I ATGGATTTTC TCCTGGCGCT GGTGCTGGTA TCCTCGCTCT ACCTGCAGGC 
51 GGCCGCCGAG TTCGACGGGA GGTGGCCCAG GCAAATAGTG TCATCGATTG 
101 GCCTATGTCG TTATGGTGGG AGGATTGACT GCTGCTGGGG CTGGGCTCGC 
151 CAGTCTTGGG GACAGTGTCA GCCTTTCTAC GTCTTAAGGC AGAGAATAGC 
201 CAGGATAAGG TGCCAGCTCA AAGCTGTGTG CCAACCACGA TGCAAACATG 
2 51 GTGAATGTAT CGGGCCAAAC AAGTGCAAGT GTCATCCTGG TTATGCTGGA 
301 AAAACCTGTA ATCAAGACGA GCACATCCCA GCTCCTCTTG ACCAAGGCAG 
351 TGAACAGCCT CTTTTCCAAC CCCTGGATCA CCAAGCCACA AGTTTGCCTT 
4 01 CAAGGGATCT AAATGAGTGT GGCCTGAAGC CCCGGCCCTG TAAGCACAGG 
4 51 TGCATGAACA CTTACGGCAG CTACAAGTGC TACTGTCTCA ACGGATATAT 
501 GCTCATGCCG GATGGTTCCT GCTCAAGTGC CCTGACCTGC TCCATGGCAA 
551 ACTGTCAGTA TGGCTGTGAT GTTGTTAAAG GACAAATACG GTGCCAGTGC 
601 CCATCCCCTG GCCTGCAGCT GGCTCCTGAT GGGAGGACCT GTGTAGATGT 
651 TGATGAATGT GCTACAGGAA GAGCCTCCTG CCCTAGATTT AGGCAATGTG 
7 01 TCAACACTTT TGGGAGCTAC ATCTGCAAGT GTCATAAAGG CTTCGATCTC 
751 ATGTATATTG GAGGCAAATA TCAATGTCAT GACATAGACG AATGCTCACT 
801 TGGTCAGTAT CAGTGCAGCA GCTTTGCTCG ATGTTATAAC GTACGTGGGT 
851 CCTACAAGTG CAAATGTAAA GAAGGATACC AGGGTGATGG ACTGACTTGT 
901 GTGTATATCC CAAAAGTTAT GATTGAACCT TCAGGTCCAA TTCATGTACC 
951 AAAGGGAAAT GGTACCATTT TAAAGGGTGA CACAGGAAAT AATAATTGGA 
1001 TTCCTGATGT TGGAAGTACT TGGTGGCCTC CGAAGACACC ATATATTCCT 
1051 CCTATCATTA CCAACAGGCC TACTTCTAAG CCAACAACAA GACCTACACC 
1101 AAAGCCAACA CCAATTCCTA CTCCACCACC ACCACCACCC CTGCCAACAG 
1151 AGCTCAGAAC ACCTCTACCA CCTACAACCC CAGAAAGGCC AACCACCGGA 
1201 CTGACAACTA TAGCACCAGC TGCCAGTACA CCTCCAGGAG GGATTACAGT 
1251 TGACAACAGG GTACAGACAG ACCCTCAGAA ACCCAGAGGA GATGTGTTCA 
1301 TTCCACGGCA ACCTTCAAAT GACTTGTTTG AAATATTTGA AATAGAAAGA 
1351 GGAGTCAGTG CAGACGATGA AGCAAAGGAT GATCCAGGTG TTCTGGTACA 
14 01 CAGTTGTAAT TTTGACCATG GACTTTGTGG ATGGATCAGG GAGAAAGACA 
14 51 ATGACTTGCA CTGGGAACCA ATCAGGGACC CAGCAGGTGG ACAATATCTG 
1501 ACAGTGTCGG CAGCCAAAGC CCCAGGGGGA AAAGCTGCAC GCTTGGTGCT 
1551 ACCTCTCGGC CGCCTCATGC ATTCAGGGGA CCTGTGCCTG TCATTCAGGC 
1601 ACAAGGTGAC GGGGCTGCAC TCTGGCACAC TCCAGGTGTT TGTGAGAAAA 
1651 CACGGTGCCC ACGGAGCAGC CCTGTGGGGA AGAAATGGTG GCCATGGCTG 

17 01 GAGGCAAACA CAGATCACCT TGCGAGGGGC TGACATCAAG AGCGTCGTCT 
1751 TCAAAGGTGA AAAAAGGCGT GGTCACACTG GGGAGATTGG AT TAG AT GAT 

18 01 GTGAGCTTGA AAAAAGGCCA CTGCTCTGAA GAACGCTAA (SEQ ID NO:l) 

FEATURES : 

Start Codon: 1 
Stop Codon: 1837 



Homologous proteins: 

Top 10 BLAST Hits: 



CRAj 
CRA| 
CRAj 
CRA| 
CRA| 
CRAj 
CRA| 
CRA| 
CRAj 
CRAj 



5000009 
4600010 
8400004 
8400004 
7800020 
1630000 
1570001 
1570001 
1000682 
6000004 



894329 
387291 
291633 
291633 
165689 
004929 
403280 
403334 
328934 
672465 



9 /altid=gi| 

8 /altid=gi| 
3 /altid=gi| 
1 / altid=gi [ 

9 /altid=gi I 
58 /altid=gi 

10 /altid-gi 
53 /altid=gi 

/altid=gi I 7 
0 /altid=gij 



15795193 /def 
15128103 /def= 
15430248 /def 
15430246 /def 
14763663 /def 
19506563 /def- 
16752658 /def 
113124888 /de 
512796 /def 
13929180 /def 



=ref |NP_ 
gb | AAK8 
gb | AAK9 
gb | AAK9 
ref |XP_ 
ref [NP_ 
=gb | AAF2 
f=ref |NP 
ir [ |T173" 
ref INP 



277060. Ij nep. 
4391.1 IAF3970. 
6011.11 (AY03. 
6010.11 (AY03. 
040394.11 epi. 
062270. lj EGF. 
7812.1 [AF1860. 

056322,21 ep. 
24 hypothetic. 
114014.11 fib. 



Score 




E 


. 1190 


0. 


0 


. 1147 


0. 


0 


1128 


0. 


0 


. 1084 


0. 


0 


397 


e- 


109 


396 


e- 


109 


396 


e- 


109 


395 


e- 


109 


385 


e- 


106 


147 


4e 


-34 



FIGURE 1A 



Blast hits to dbEST: 



CRA Number 


gi 


Number 


Score 


Expect 






CRA| 113000119195127 




14391046 


106E 


bits 


(539) 


0. 


0 


CRA | 110000024586745 


gi 


12320908 


866 


bits 


(437) 


0 . 


0 


CRAI87000000737360 


gi 


7312275 


842 


bits 


{425} 


0. 


0 


CRA| 113000119269070 




14399233 


831 


bits 


(419} 


0. 


0 


CRAf 113000119283425 


gi 


14400823 


827 


bits 


(417) 


0. 


0 


CRAU13000119283416 


gi 


14400822 


785 


bits 


(396) 


0. 


0 


CRA|112000057002060 


gi 


14377756 


680 


bits 


(343} 


0. 


0 


CRA[ 147000029699826 


gi 


11300389 


668 


bits 


(337) 


0. 


0 


CRA \ 117000088322182 


gi 


12462132 


563 


bits 


(284) 


le 


-157 



Expression (Tissue Source) : 



gi 


Number 


Organ 


gi 


14391046 


head neck 


gi 


12320908 


nervous tumor 


gi 


7312275 


head neck 


gi 


14399233 


head neck 


gi 


14400823 


head neck 


gi 


14400822 


head neck 


gi 


14377756 


colon 


gi 


11300389 


breast_normal 


gi 


12462132 


placenta normal 



FIGURE IB 



1 MDFLLALVLV SSLYLQAAAE FDGRWPRQIV SSIGLCRYGG RIDCCWGWAR 

51 QSWGQCQPFY VLRQRIARIR CQLKAVCQPR CKHGECIGPN KCKCHPGYAG 

101 KTCNQDEHIP APLDQGSEQP LFQPLDHQAT SLPSRDLNEC GLKPRPCKHR 

151 CMNTYGSYKC YCLNGYMLMP DGSCSSALTC SMANCQYGCD VVKGQIRCQC 

201 PSPGLQLAPD GRTCVDVDEC ATGRASCPRF RQCVNTFGSY ICKCHKGFDL 

251 MYIGGKYQCH DIDECSLGQY QCSSFARCYN VRGSYKCKCK EGYQGDGLTC 

301 VYIPKVMIEP SGPIHVPKGN GTILKGDTGN NNWIPDVGST WWPPKTPYIP 

351 PIITNRPTSK PTTRPTPKPT PIPTPPPPPP LPTELRTPLP PTTPERPTTG 

401 LTTIAPAAST PPGGITVDNR VQTDPQKPRG DVFIPRQPSN DLFEIFEIER 

451 GVSADDEAKD DPGVLVHSCN FDHGLCGWIR EKDNDLHWEP IRDPAGGQYL 

501 TVSAAKAPGG KAARLVLPLG RLMHSGDLCL SFRHKVTGLH SGTLQVFVRK 

551 HGAHGAALWG RNGGHGWRQT QITLRGADIK SVVFKGEKRR GHTGEIGLDD 

601 VSLKKGHCSE ER { SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions : 

Prosite results: 

PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation sire 

320-323 NGTI 

PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



fi Number of 


matches: 10 






157-159 


SYK 


5 i 


222-224 


TGR 


*? 3 


284-286 


SYK 


4 4 


354-356 


TNR 


1} 5 


358-360 


TSK 


•A 6 


362-364 


TTR 


i 7 


366-368 


TPK 


' 8 


531-533 


SFR 




573-575 


TLR 


:3 io 


602-604 


SLK 



PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 
Number of matches: 3 

1 213-216 TCVD 

2 392-395 TTPE 

3 453-456 SADD 

PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 
24 6-252 KGFDLMY 



PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



of 


matches: 10 




1 


156-161 


GSYKCY 


2 


172-177 


GSCSSA 


3 


194-199 


GQIRCQ 


4 


238-243 


GSYICK 


5 


268-273 


GQYQCS 


6 


283-288 


GSYKCK 


7 


497-502 


GQYLTV 


8 


509-514 


GGKAAR 


9 


538-543 


GLHSGT 


10 


552-557 


GAHGAA 



FIGURE 2A 



PDOC00Q10 PS00010 ASX_HYDROXYL 

Aspartic acid and asparagme hydroxylation site 
Number of matches: 3 

1 151-162 CMNTYGSYKCYC 

2 233-244 CVNTFGSYICKC 

3 278-289 CYNVRG S YKCKC 

PDOC00016 PS00016 RGD 
Cell attachment sequence 

429-431 RGD 

PDOC00021 PS00022 EGF__1 
EGF-like domain signature 1 

92-103 CKCH PGYAGKTC 

PDOC00021 PS01186 EGF_2 
EGF-like domain signature 2 
Number of matches: 3 

1 92-103 CKCH PGYAGKTC 

2 160-17 4 CYCLNGYMLMPDGSC 

3 287-300 CKCKEGYQGDGLTC 

PDOC00913 PS01187 EGF_CA 

Calcium-binding EGF-like domain signature 
Number of matches: 3 

1 136-160 DLNECGLKPRPCKHRCMNTYGSYKC 

2 216-242 DV DE C AT GRAS C PRFRQCVN T FG S Y I C 

3 261-287 DI DECSLGQYQCSS FARC YNVRGS YKC 



Membrane spanning structure and domains 



Helix 


Begin 


End 


1 


3 


23 


2 


163 


183 


3 


397 


417 



Score Certainity 
0.942 Putative 
0.765 Putative 
1.209 Certain 



SignalP results: 

< Is the sequence a signal peptide? 

# Measure Position Value Cutoff Conclusion 
max. C 20 0.657 0.37 YES 

max. Y 20 0.713 0.34 YES 

max. S 6 0.960 0.88 YES 

mean S 1-19 0.901 0.48 YES 

# Most likely cleavage site between pos . 19 and 20: AAA-EF 



FIGURE 2B 



BLAST Alignment to Top Hxt: 

>CRA|50000098943299 /altid=gi I 157 951 93 /def-ref i NP_277 0 60 . 1 
nephronectin [Mus masculus] /org-Mus rausculus 
/taxon=10090 /div-ROD /dataset-nraa /length-609 
Length = 609 



Score = 1190 bits (3045), Expect =0.0 

Identities = 536/610 (87%), Positives - 569/610 {92%} , Gaps = 3/610 (0%) 



Query : 


1 


MDFLLALVLVSSLYLQAAAEFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 


60 






M LLA VL SSLYLQ AA+FDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 




OJJJ LL . 


i 
± 


MAVLLAAVLASSLYLQVAADFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 


60 


Query : 


61 


VLRQRIARIRCQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHI-PAPLDQGSEQ 


119 


oD] c c : 




VLRQR+ARIRCQLKAVCQP+CKHGEC+GPNKCKCHPG+AGKTCNQDE P PLDQGSEQ 




C 1 

bl 


VLRQRLARIRCQLKAVCQPQCKHGECVGPNKCKCHPGFAGKTCNQDESFHPTPLDQGSEQ 


120 


Query : 


120 


PLFQPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSALT 


17 9 






PLFQP DHQAT++PSRDLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+PDGSCSSAL+ 




Sb j ct : 


1 O 1 


PLFQPPDHQATNVPSRDLNECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCSSALS 


180 


Query: 


180 


CSMANCQYGCDVVKGQIRCQCPSPGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGS 


239 






CSMANCQYGCDVVKGQ+RCQCPSPGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGS 




Sbjct: 


181 


CSMANCQYGCDWKGQVRCQCPSPGLQIiAPDGRTCVDIDECATGRVSCPRFRQCVNTFGS 


240 


^ Query: 


240 


YICKCHKGFDLMYIGGKYQCHDIDECSLGQYQCSSFARCYNVRGSYKCKCKEGYQGDGLT 


2 99 






YICKCH GFDLMYIGGKYQCHDIDECSLGQ+QCSS+ARCYN+ GS YKC+C++GY+GDGL 




HI. Sbj ct : 


2 41 


YICKCHTGFDLMYIGGKYQCHDIDECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLN 


300 


Query: 


300 


CVYIPKVMIEPSGPIHVPKGNGTILKGDTGMNNWIPDVGSTWWPPKTPYIPPIITNRPTS 


359 






CVYI PKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPYIPP+ITNRPTS 




Sbjct: 


301 


CVYIPKVMIEPSGPIHMPERNGTISKGDGGHANRIPDAGSTRWPLKTPYIPPVITNRPTS 


360 


l„.u Query: 


360 


KPTTRPTPKPTPI PTPPPPPPLPTELRT-PT.PPTTPFRPTTf^T TTTflPaziQTPDrrTwn 


4 1 o 






KPTTRPTP PTP PTPPPPPPLPTE RT PLPPT PERP+T TTIAPA ST ITVD 




Sbjct: 


361 


KPTTRPTPNPTPQPTPPPPPPLPTEPRTTPLPPT-PERPSTRPTTIAPATSTTTRVITVD 


419 


%j Query: 


419 


NRVQTDPQKPRGDVFIPRQPSNDLFEIFEIERGVSADDEAKDDPGVLVHSCNFDHGLCGW 


478 






NR+QTDPQKPRGDVFI PRQP+NDLFEI FEIERGVSAD+E KDDPG+L+HSCNFDHGLCGW 




«■ ■ Sbjct: 


420 


NRIQTDPQKPRGDVFIPRQPTNDLFEIFEIERGVSADEEVKDDPGILIHSCNFDHGLCGW 


479 


Query: 


479 


IREKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRHKVTG 


538 






IREKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLSFRHKVTG 




Sbjct: 


480 


IREKDSDLHWETARDPAGGQYLTVSAAKAPGGKAARLVLRLGHLMHSGDLCLSFRHKVTG 


539 


Query : 


539 


LHSGTLQVFVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSVVFKGEKRRGHTGEIGL 


598 


Sbjct: 




LHSGTLQVFVRKHG HGAALWGRNGGHGWRQTQITLRGAD+KSV+FKGEKRRGHTGEIGL 




540 


LHSGTLQVFVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKSVIFKGEKRRGHTGEIGL 


599 


Query: 


599 


DDVSLKKGHC 608 








DDVSLK+G C 




Sbjct: 


600 


DDVSLKRGRC 609 (SEQ ID NO: 4) 





FIGURE 2C 



>CRA| 46000103872918 /altid=gi | 15128103 /def=gb | AAK84391 . 1 I AF397 007_1 
(AF397007) nephronectm [Mus musculus] /org=Mus musculus 
/taxon=10090 /div=ROD /dataset=nraa /length-592 
Length = 592 

Score - 1147 bits (2934), Expect - 0.0 

Identities - 520/610 (85%), Positives = 552/610 (90%), Gaps - 20/610 (3%) 

Query: 1 MDFLLALVLVSSLYLQAAAEFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 60 

M LLA VL SSLYLQ AA+FDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQP 
Sbjct : 1 MAVLLAAVLASSLYLQVAADFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQP-- 58 

Query: 61 VLRQRIARIRCQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHI-PAPLDQGSEQ 119 

VCQP-f CKHGEC+GPNKCKCHPG+AGKTCNQDE P PLDQGSEQ 
Sbjct: 59 VCQPQCKHGECVGPNKCKCHPGFAGKTCNQDESFHPT PLDQGSEQ 103 

Query: 120 PLFQPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSALT 17 9 

PLFQP DHQAT++PSRDLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+PDGSCSSAL+ 
Sbjct: 104 PLFQPPDHQATNVPSRDLNECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCSSALS 163 

Query: 180 CSMANCQYGCDVVKGQIRCQCPSPGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGS 239 

CSMANCQYGCDVVKGQ+RCQCPSPGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGS 
Sbjct: 164 CSMANCQYGCDVVKGQVRCQCPSPGLQLAPDGRTCVDIDECATGRVSCPRFRQCVNTFGS 223 

Query: 240 YICKCHKGFDLMYIGGKYQCHDIDECSLGQYQCSSFARCYNVRGSYKCKCKEGYQGDGLT 299 

YICKCH GFDLMYIGGKYQCHDIDECSLGQ+QCSS+ARCYN+ GSYKC+C++GY+GDGL 
Sbjct: 224 YICKCHTGFDLMYIGGKYQCHDIDECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLN 283 

Query: 300 CVYIPKVMIEPSGPIHVPKGNGTILKGDTGNNNWIPDVGSTWWPPKTPYIPPI ITNRPTS 359 

CVYIPKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPY I PP+ ITNRPTS 
Sbjct: 284 CVYIPKVMIEPSGPIHMPERNGTISKGDGGHANRIPDAGSTRWPLKTPYIPPVITNRPTS 343 

Query: 360 KPTTRPTPKPTPIPTPPPPPPLPTELRT-PLPPTTPERPTTGLTTIAPAASTPPGGITVD 418 

KPTTRPTP PTP PTPPPPPPLPTE RT PLPPT PERP+T TTIAPA ST ITVD 
Sbjct: 34 4 KPTTRPTPNPTPQPTPPPPPPLPTEPRTTPLPPT-PERPSTRPTTIAPATSTTTRVITVD 4 02 

Query: 419 NRVQTDPQKPRGDVFIPRQPSNDLFEIFEIERGVSADDEAKDDPGVLVHSCNFDHGLCGW 478 

NR+QTDPQKPRGDVFIPRQP+NDLFEIFEIERGVSAD+E KDDPG+L+HSCNFDHGLCGW 
Sbjct: 403 NRIQTDPQKPRGDVFIPRQPTNDLFEIFEIERGVSADEEVKDDPGILIHSCNFDHGLCGW 462 

Query: 47 9 IREKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRHKVTG 538 

IREKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLSFRHKVTG 
Sbjct: 4 63 IREKDSDLHWETARDPAGGQYLTVSAAKAPGGKAARLVLRLGHLMHSGDLCLSFRHKVTG 522 

Query: 539 LHSGTLQVFVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSWFKGEKRRGHTGEIGL 598 

LHSGTLQVFVRKHG HGAALWGRNGGHGWRQTQITLRGAD+KSV+FKGEKRRGHTGEIGL 
Sbjct: 523 LHSGTLQVFVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKSVIFKGEKRRGHTGEIGL 582 

Query: 599 DDVSLKKGHC 608 
DDVSLK+G C 

Sbjct: 583 DDVSLKRGRC 592 (SEQ ID NO: 5} 



FIGURE 2D 



>CRA|84000042916333 /alt id-gi I 1 54 3 02 4 8 /def=gb i AAK9601 1 . 1 i 

(AY035899) nephronectin long isoform [Mus rnusculus] 
/org-Mus rnusculus /taxon-10090 /div=R0D /dataset=nraa 
/length-578 
Length = 578 



Score = 1128 bits (2885), Expect = 0.0 

Identities - 512/609 (84%), Positives - 543/609 (89%), Gaps - 32/609 (5%) 



Query: 


1 


MDFLLALVLVSSLYLQAAAEFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 


60 






M LLA VL SSLYLQ AA+FDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 




Sbjct: 


1 


MAVLLAAVLASSLYLQVAADFDGRVvTPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 


60 


Query: 


61 


VLRQRIARIRCQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHIPAPLDQGSEQP 


120 






VLRQR+ARIRCQLKAVCQP+CKHGEC+GPNKCKCHPG+AGKTCNQ 




Sbjct: 


61 


VLRQRLARIRCQLKAVCQPQCKHGECVGPNKCKCHPGFAGKTCNQ 


105 


Query : 


121 


LFQPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSALTC 


180 






DLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+PDGSCSSAL+C 




Sbjct: 


106 


Di^NECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCSSALSC 


150 


Query: 


181 


SMANCQYGCDWKGQIRCQCPSPGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGSY 24 0 


.; • s j 




SMANCQYGCDVVKGQ+RCQCPSPGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGSY 




^[| Sbjct: 

' !, ;A 


151 


SMANCQYGCDVVKGQVRCQCPSPGLQLAPDGRTCVDIDECATGRVSCPRFRQCVNTFGSY 210 


fl\ Query: 


241 


ICKCHKGFDLMYIGGKYQCHDIDECSLGQYQCSSFARCYNVRGSYKCKCKEGYQGDGLTC 300 






ICKCH GFDLMYIGGKYQCHDIDECSLGQ+QCSS+ARCYN+ GSYKC+C++GY+GDGL C 




; : Sbjct: 


211 


ICKCHTGFDLMYIGGKYQCHDIDECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLNC 


270 


%\ Query: 


301 


VYIPKVMIEPSGPIHVPKGNGTILKGDTGNNNWIPDVGSTWWPPKTPYIPPIITNRPTSK 


360 






VYIPKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPYIPP+ITNRPTSK 




i , Sbjct : 


271 


VYIPKVMIEPSGPIHMPERNGTISKGDGGHANRIPDAGSTRWPLKTPYIPPVITNRPTSK 


330 


h*$ Query: 


361 


PTTRPTPKPTPIPTPPPPPPLPTELR-TPLPPTTPERPTTGLTTIAPAASTPPGGITVDN 


419 






PTTRPTP PTP PTPPPPPPLPTE R TPLPP TPERP+T TTIAPA ST ITVDN 




%\ Sbjct: 


331 


PTTRPTPNPTPQPTPPPPPPLPTEPRTTPLPP-TPERPSTRPTTIAPATSTTTRVITVDN 


389 


Query: 


420 


RVQTDPQKPRGDVFIPRQPSNDLFEIFEIERGVSADDEAKDDPGVLVHSCNFDHGLCGWI 


479 






R+QTDPQKPRGDVFIPRQP+NDLFEIFEIERGVSAD+E KDDPG+L+HSCNFDHGLCGWI 




Sbjct: 


390 


RIQTDPQKPRGDVFIPRQPTNDLFEIFEIERGVSADEEVKDDPGILIHSCNFDHGLCGWI 


449 


Query: 


480 


REKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRHKVTGL 


539 






REKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLS FRHKVTGL 




Sbjct: 


450 


REKDSDLHWETARDPAGGQYLTVSAAKAPGGKAARLVLRLGHLMHSGDLCLSFRHKVTGL 


509 


Query: 


540 


HSGTLQVFVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSVVFKGEKRRGHTGEIGLD 


599 






HSGTLQVFVRKHG HGAALWGRNGGHGWRQTQITLRGAD+KSV+FKGEKRRGHTGEIGLD 




Sbjct: 


510 


HSGTLQVFVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKSVIFKGEKRRGHTGEIGLD 


569 


Query: 


600 


DVSLKKGHC 608 








DVSLK+G C 




Sbjct: 


570 


DVSLKRGRC 578 (SEQ ID NO: 6) 





FIGURE 2E 



Hmmer search results (Pfam) : 



Scores 


for sequence family classification (score 


includes all 


domains) : 




Model 


Description 


Score 


E-value 


N 


CE00409 


E00409 fibulin 


108.3 


3.8e-30 


3 


PF00629 


MAM domain. 


88.2 


1.7e-22 


1 


PF00008 


EGF-like domain 


68.0 


1.7e-17 


5 


CE00283 


E00283 selectin 


13.0 


0.014 


2 


PF01278 


Omptin family 


1.9 


6.6 


1 



Parsed 
Model 


for domains : 
Domain seq-f 


seq-t 


hmm-f 


hmm-t 


score 


E-value 


PF00008 


1/5 


77 


103 . 


1 


45 [] 


17 


3 


0.0036 


CE00283 


1/2 


74 


105 . 


119 


153 . . 


10 


7 


0.059 


PF00008 


2/5 


140 


172 . 


1 


42 [ . 


19 


4 


0.00091 


PF00008 


3/5 


180 


214 . 


1 


45 [] 


19 


0 


0.0012 


CE00409 


1/3 


136 


227 . 


415 


514 . . 


69 


4 


4.5e-19 


CE00283 


2/2 


226 


248 . 


124 


146 . . 


8 


7 


0.2 


PF00008 


4/5 


220 


249 . 


1 


41 [- 


10 


8 


0.25 


CE00409 


2/3 


232 


273 . 


429 


473 . . 


35 


2 


2.3e-09 


CE00409 


3/3 


277 


298 . 


344 


365 . . 


16 


3 


0.00053 


PF00008 


5/5 


265 


300 . 


1 


45 [] 


31 


7 


2.9e-07 


PF01278 


1/1 


477 


493 . 


241 


257 . . 


1 


9 


6.6 


PF00629 


1/1 


469 


610 . 


1 


170 [] 


88 


2 


1.7e-22 
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1 


GAGAAAATTG 




51 


TATCAAAAGT 




101 


TTTTGATCTC 




151 


TCACTCCCAT 




201 


TAGTTGCAGC 




251 


TTATTCCATG 




301 


GGGAGAAGCT 




351 


TTTNNNNNNN 




401 


NNNNNNNNNN 




451 


CTAATGTCTC 




501 


CATTAAGAGC 




551 


ACTAGTCTCA 




601 


GGCTAAATTT 




651 


AAGAGGAATG 




701 


GGCCAAAGCA 




751 


ATTACACTTT 




801 


GTCTTTTCCT 




851 


TCTAATCATC 




901 


CACCAACTAA 




951 


ACCTGGAACC 




1001 


TATTTTGACT 




1051 


GACACCAAGC 


: ;:b:i 

h i 


1101 


ATCTTTCTTG 


1151 


TAGAAATCAA 




1201 


AAGAAAATTC 


$ 


1251 


CCTACCACAG 


U. I. 

'H 

■* 


1301 


AGAAAATGAA 


ill 


1351 


NNNNNNNNNN 




1401 


NNNNNNNNNN 




1451 


NNNNNNNNNN 




1501 


NNNNNNNNNN 




1551 


NNNNNNNNNN 




1601 


NNNNNNNNNN 




1651 


NNNNNNNNNN 




1701 


NNNNNNNNNN 


ri 


1751 


NNNNNNNNNN 


\m 


1801 


NNNNNNNNNN 


%, I 


1851 


NNNNNNNNNN 


J- 
Ul 


1901 


AAGGCTGTCA 




1951 


CATGTACATC 




2001 


ACAATACTTT 




2051 


TCTAAGTGAT 




2101 


AGCCCTTCCT 




2151 


AATGTGCTCG 




2201 


TTCTTCAAAA 




2251 


AATAAAAAAT 




2301 


TTTGCCTTCG 




2351 


ACACTGCTCA 




2401 


CCCTCCGGTG 




2451 


GGAACCTGCG 




2501 


CCCCTTTCCT 




2551 


ACTTTTCTCC 




2601 


ACTGCCGTCC 




2651 


CCCGGGAGGA 




2701 


CGAGTCGGGC 




2751 


GGGCGGTGCA 




2801 


AGGGGGCTCC 




2851 


CGCTGTCCTC 




2901 


TTCCTCGAGA 




2951 


CCTGTTCCTC 



AGATTACTAC CTGCAAGGTG 
TTGTCCTCCT GAAAAAGTAG 
ATTCTTGCTC ATTTGTTTTT 
GGCAACTCTG CCTCTTTTTC 
ATAATAACCA GTTAGACCTT 
ATTTAACAAA GACTGATATA 
AAAGTTCACA GGCAGAATTC 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
ACGGGACCCA GAGTGGTAGG 
TACCTTAGAG CATGCTAAGT 
GCCAATCAAA AAGTCAGAAA 
AAACTCAAAG GGTAAGGGGA 
AAGAGGAGGC TAAATTTAAA 
AATATATGGA AAACAAAAGA 
TCCCTCCAGT TATTTATCCT 
TAGATCTCCA GTTTTTTAGC 
CATTGCACAC AGTGAGGTTT 
CCAACCAACA AACAAAAACA 
ATGCCTACAC T T AC AAAAAT 
CCTACATTTT TGGACTTCAT 
TGTTTGCCTA AAATAATGTC 
AGATTAAAAT AGAAACTTGT 
AGAGCCATTT AAGTTGAAAC 
CTGTTTTTAC ACACACTGAA 
AACTTTCCAT TAAAAATTTG 
GATGGGGGGA AAACGTTTGT 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
TTCGAGGTAA GGTTGGGAAG 
TTAGGATAAT CATGTTTTCA 
TTACTCTGCA ACAACTGAAC 
CCCCTCAAGA TCAGGCACCA 
CTCTGGCAGG CACTCATCCC 
GGAGTTGAGC TGATAGAGAC 
CCCGACTGGA GAGCCAAATA 
ATGCCTTTGC CAACTCTGCG 
CTAATTTGTC CAAGTCGTGC 
GCTCCAGCCC CATAAACCTC 
AAAGCTACTG CTCGCCCCTG 
GAGCGTGCAC CTACGCCTCG 
GCTGGGCACC CTGCTTTCCC 
CAGGCTGGGG CTCGCCCTGC 
CGCAGCGCCC CTGCCCCCGA 
GCGGAGCGCG CTCACCCTTC 
GCACGCGCAC CCCCTGCCCG 
GCTACCCCTG CAGCGCCTCC 
GGGCGCCGCG CAGCAGACCT 
CGGGAGCGGC AGCAGTAGCC 
CTCTCAGAGG GGCGCCTCCC 
GCGCGCCACT GCGCTGCGCC 



TCATTACCTG GTAAGAAGCC 
TTATTGCTAA AAGCTAGCTG 
AAGACTGAGA TAATGAAATG 
GGAATGATCA TTGGTGGTCA 
GGAAATCCTT TAGATTCTCC 
ATTAGCTACA TTTTACTGAA 
AATTTAATCC AATCCATCTG 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNCT ATTGGTTAGA 
GAGGCAATGA CAACACAATA 
GTAATAGAAG TATATAAAAC 
AGGCTTCCTG GGGGACTGTC 
GATTGTCCAA ATGAAGAAGG 
TGAAGAAGGA GTGTTTTCTA 
AAGACGCATG TAGATGGGGC 
GTTTTCATCC ACCACTCTTC 
CGTATATTAC CCCCTTTCTC 
ATTTGTAAAT CTAACCCAGG 
GCTAAGGAGT AGCCACTGGA 
TTGATTTTCT GCAGAAAAAA 
TAGAAGGACC TGAAATGGAT 
CCAAGCCTGA ATTGGCATGG 
TTTGCTAACT GAAAACAACT 
CATTATTTTT CCTTTCCTTG 
TGATCAGGAT AGTGAATCAC 
AAGTTGTAGA AACCTCAAAA 
AATGTAGCAA ATGANNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNAGATG ATCACTGCAC 
AGGGGTGACT TGATTGGGCT 
TTTGAACACA GTCATTTAGT 
AAT GAT TACT TAAAATGGTT 
CAACACTACT AGGCAGTAGC 
TAGAGGGGCA GTCCTGTCCT 
AGAAACAGTC TGCTCATTTC 
TTTCTTTTGG TTAGTTACAA 
GGAAAAGCAA ACTCCCAGAG 
TGTGGTTTTA GTAAATGCAG 
TCTGCTCTAG GGCTTCTCCG 
CAGTCACCAC CTGTTCGGGC 
GGCTCCTTTC CTCCTCCACT 
TCTCCCAGAG AGGGTTTGCA 
TTGGCTAACC CCCAAGAGCC 
GTTGCCTGCC CCGCTGGGCC 
GCCCGGGGCT GGGAGGGCGG 
CCCCTGGCGC CCCTCCCCGC 
CCTAGCTAGA AGGGAGCGGG 
GCTCCGGCCG CGCGCCTCGC 
CGGGCGGCGA GGGCTGGGGG 
ATCGGCGCCC ACCACCCCAA 
CCAGGACCCG CTGCCCAACA 
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3001 


TGGATTTTCT 




3051 


GCCGCCGAGT 




3101 


CTTCCCGCGC 




3151 


GGTTTCGTGG 




3201 


ACAGCGTGGC 




3251 


GCGGAGAGGG 




3301 


AGGGCGACTC 




3351 


GGAGCCTGGG 




3401 


GTGACGCGCG 




3451 


CGGCTCTTCA 




3501 


TGCTCAGGGC 




3551 


ACTGGGCTAC 




3601 


GAAGGAGACT 




3651 


CATGAGTGTG 




3701 


AGACTCGGGC 




3751 


CGGGCCGCGT 




3801 


GCTCCGGTGT 




3851 


CCCTCACCTC 




3901 


TGCTGCGAAA 




3951 


CCTTCAGCGA 




4001 


TAAAGAAAGA 




4051 


TCCACAATTG 




4101 


GTGGAGCGCC 




4151 


AAATAGAAAG 


M 


4201 


CAGGTAAAAG 




4251 


GTGATAGCAG 




4301 


CTGTGCTCAC 




4351 


CAAGAAACCC 


ft is if 


4401 


TGTAACTTTA 




4451 


GAGGAAAAAA 


J| 


4501 


AATAATGTGT 


^1 


4551 


CTTTCTTTAG 




4601 


ATCTAAGCAC 




4651 


TAAACTCTCC 




4701 


TTTTGTTTCT 


::! 


4751 


TTTTGAGAAA 




4801 


GAAATTTTTT 


*'l 


4851 


TGAATCAGTC 




4901 


GGAATCTGTT 




4951 


TGAGAATTGT 


-| 


5001 


TCCCTCCCGT 




5051 


ACTGTCTGAA 




5101 


GTTTTAACCT 




5151 


GGGTGTGGTT 




5201 


TCTTCTCAAA 




5251 


GCCCAGGCAA 




5301 


TTGACTGCTG 




5351 


GAGTATCAAG 




5401 


TTGTTCATGG 




5451 


TTGGCCTTGC 




5501 


TATTCTGCTT 




5551 


CAT TC AC AG A 




5601 


ATCTGCATCT 




5651 


NNNNNNNNNN 




5701 


NNNNNNNNNN 




5751 


AGATGGGGAT 




5801 


TAATGAGTCT 




5851 


CTTATATAAT 




5901 


CCTTTTTTGG 




5951 


TATAATGAAA 



CCTGGCGCTG GTGCTGGTAT 
TCGACGGGAG GTGAGCTGGG 
TAATTTCACA CTCACTGTCT 
TCAGAGAGGC GTCTCCTCCA 
GCGAGGAGAG CGGTCCAGCG 
CGCGCCCTTG CGAGTCTGGG 
GCCCCGGCTC GGGAATTAGG 
ATCTCGGCTC TGAGGGCGCG 
AAACATCCCT TACCCGGGAA 
CTGGGGAAGC TTCCAGGCCC 
TCTCGGGGCC GCTCACACAG 
GGGGGAGATT TGTGGGCCTC 
CAAAATGAGG ACCGGAGGGT 
TGGATGTGTG TGTGAGAGCG 
GAGCCTGACG GCGGCGTGCT 
CTCCGCTGTC ACTCAGCCGG 
GTGAGAGACA CTGGGTCTGT 
TGTGCAATTA CAGACTAGGC 
AGAGGTTTTA CTCTGGCGCA 
CCTCTGCCCC CACCCCTCCG 
TGCCGCACCT GTTTTACCTT 
ATTCGCTTTC TTGTACCCGG 
AAGAAAAAGC ACAGCTGCTT 
TTCGGGGTTA TTTTGTACTT 
AAAAGGGGAA TTGAAAAAGA 
CCAGAGTGTA GACCTTTTTA 
ATTTGGGTAT GTTATGCATT 
ATCACAAGAT ATGCATATGA 
AAAAGATGTC TAAAACAGCA 
GTCATTTAGG TGTGACTCTA 
CAACGTGTAA TCAGGTGGAT 
GCATTATTTA CAT TAG AG AG 
GTTTCAAACC ACAAGGATAC 
TTGGTTTTAT TTAGACTTTT 
GCCACAGTTA TGTCTCAAAA 
CCTTTCTGCT CCTTCCCCCA 
CTGCTTTGAC ATGTCTTACT 
AAACCTGGCT TTCACCCCAG 
TTATTTCAAG CTTCAGAAAC 
TGGTGTTAGC TGGTGCTATT 
TTTTTCTCCT ACCCTTGTGT 
ATTTGACATT AGATCATAGC 
TAAATTTCTG CAGGCATAAA 
GACATTTTTT CTGAAGTAAT 
AGTGTCCTTA TTTATTTTGA 
ATAGTGTCAT CGATTGGCCT 
CTGGGGCTGG GCTCGCCAGT 
CCTGGGGACT TCAGTTCCCT 
CTTCACCCCA CATATCAGAG 
AGGTCTGACT TGGGGATTTT 
CTTTTCAGCT TTAGCCACCT 
GAGCCACCTT AAATGTTTCT 
CCTTCCTGTT TTCTTACCAG 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNCTGA 
AAGATAATGA GTCTAGACCG 
AGACCGTATT TACACAAAAT 
CCTCACCATA GCTGTTGTGT 
ACTGGGGTTG TATTGACAGT 
CAAAGCCACT GAAACAACAT 



CCTCGCTCTA CCTGCAGGCG 
CCCCGGGGCG CCCTCTCCTC 
TGGGTCACTT TTCCCCGCGG 
TCCAGAAGTT GGGCCACCGC 
GCTCCGAGTG CCCGCCCGAG 
ACCCCATCCG CGGCCCCCCG 
ACTGAGGGAG AGGAGCCGCT 
GTTTAGCCAC CTACGCCGAG 
ACTCCCGCGC CTGAACTAGA 
CCGGGGGGAG GCCCGGGCTC 
AGAGTGGGTG CGAGTCAGCG 
TCCATTTGGT TTTCTTGAGG 
GGGCGCTCCG TGAATGTGAG 
CACACACTGC GCCGCTCCTC 
GTGACAGGTT CCAACAACCT 
TCCTCCCGCG CTCCGGGGCC 
CGGGAGGGTG TGCTCGGTCC 
TCGTCCCGGG TGCAGATGGC 
CACCGTCGCC CGGTCGCCTT 
TGTAGGTCCC TGCCTAGTCC 
TAATCTTGGA ACGAATCAAT 
AGGCGAACGG AGGGGAACCA 
GTTCACTACC TGGTCGGAGG 
TGGGCCTGGG GGTAGAAAGG 
TAAGGAGACT TTAAGAAAAC 
TTTTATTTTT AAAAGCAATT 
TCTTGCACAC ATATTTGCAA 
TGTGAATGCA TATAGCTTTT 
AATTAAAATG TTGATACCCA 
AACAAGGAAA CAATTTAGCA 
AAATCTGTAT CCTGAAATTA 
GAAAATACTA TGAATTGTTG 
TTGAGATATC AGCTACACTT 
TTCTAGTACT TTTTTGTTTT 
AGAGCTCTGT TACACTTGAT 
TCCACCTTTT ATATGAGGCA 
TAATACTTTT CAGTTTATGG 
TAACAGGTGT GGGTCTTTGA 
TATCCTGTGA GTGGCAGCCT 
TCTACTTAAG AAAGCATCTT 
GTTGGGGGAT GGGGTATTTA 
AGAACTATTT CTGTGAAGGT 
AGAGTTTGTA GAAAATTGTT 
AATACAGAGG AAATTACCTT 
ATCTTTTTTT TTGGTAGGTG 
ATGTCGTTAT GGTGGGAGGA 
CTTGGGGACA GTGTCAGCGT 
GGGAGGTGTG GCTTTCCACC 
GGTTCATTAC TGAGCAAGGC 
CAGGTACAGT CCAGACTCCT 
GTATTACGGC CCAGCTTTGT 
TTTAGCATTT TTCCCCCCAA 
GTTTATTNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
CTCGCTTGCT CCAGGCTGGG 
TATTTACACT GGGGATAAGA 
AAACTCTGAA CCTTAGAGTA 
AAAGTGGCCC ATTTGTAAAT 
GTTTTAAATG GACCACCCAG 
TATTTGTAAG TTCTATAAGT 
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6001 AACCAACCTC ATGTAATAAG TACCAAATCA GAAAGCTGAT TTTCACCTTT 
6051 TCTGTGACCA CAAATGGATA TTTTAATGGT AAAATTTAGA GCTACATCAA 
6101 AAGAGTTTGA GAAATTTAGG AAATTTGTCC CCAGCATTTT ATCGCGATCT 
6151 TAAAATTGTA TCTCACTCCT ACTGCAAAAA AATAGTCTTC AAATGATCAA 
6201 GTACCTTTCC AGAGCACCCT TAGAGTGCTT GGGGCGGGGG TAGGGGGCAC 
6251 TTCTGGTAAG ATGATGGGAA CTAAGTTGGG TTCTACATTG GGATATATAT 
6301 TTTATTGCTA AT GAG GAG G A GGCTTAGAGG AAGAGAGAAG GGCAGTTACG 
6351 AAGGCTAGAG CTGGCAATGG AGAAGCCTGC CTTAGAGATG GGTTGCTAGT 
64 01 GTGAGGAGTC AGGCAAATTT AAGTTCAGGA AAGTTAGGAG TTCCCTCTGC 
64 51 TATTTTAATT TTTGAGGATG CTTGCAATGT CTTCCTTAAT TTTGTGAAAG 
6501 AGGGACAGTG ACAGTCACAG ATTGACTCTA ATTGCACATA AAGACCACAA 
6551 TCTCTGGTTG GGAATAGAAA GGTAAAGGAA ATGAATGTTT GCCTACCTGG 
6601 TATGGAATTT GAGAACCAAC AGATTCTAAT AACCAAAATG TGAAGAAAGG 
6651 ACCCTTCTGT TGGCCCAACA CACCTACACA TAACCCTCCT GAGTGAAAAA 
67 01 TGAGTAGTTC TATACCTGCA GTCTCCAGCT GTGCAAATAC TTCTGATACT 
6751 ACAGAAGACT AAATTCCACC AGGCACCATT CTTCTTTTTT GATCATCTTC 
6801 CCTTAAAACA ATATTGAATA GACTAACCAG TGAGTGTACA GCAGCTTTGC 
6851 CTACTTCTTT TGTTTACTGG AAACTGGAGT TACCACCATC TCCCTTTAAC 
6901 AGAATGTAAT TGACCCCCCG TGCAAAGGGT CTAGCCAAGC AGCTCGCACA 
6951 TAGCAGTTAG GCAAACATTC GTTTGCTTCA CCTTTTATGT TTATACTTCC 
7001 AAAATCAAAG TAGTTCCAAG GTTTTCTATG TTACAGTGAA ATAAAATCCC 
7 051 TTTCAATTAA AAAGGCACAA ATGGTTCTTT ACTATTAACA TTGAAGTTGG 
O 7101 TGAGGTTGTA AAAATTAGCT CAAAGGTGAA TGTTTCCTTC TGTGTATTTT 
/>! 7151 ATTTTTCCAG CATCTTAGAT GGCGCAAATG TCTCTTGTCA GTTCAGAGTT 
7201 CTGCCTGTCT TTGTTTTGAT ATAAGCAGGT AGAGGAATGT GGGGCTGAGA 
;y 7251 AGTAAGCTTG AAGGGGCAGA AC AAAC C AAA AAGAGGCTGA TCAGATTGAA 
\\ 7301 TGAAATATCT CTGAAAACTC TTGATTATTT TAAAGAAAGT CTTTATGAAA 
m 7 351 TTAAAAGTTT TGTCACTACG TTTGTTCAAG AAAATGCCTT GCTATTGTAT 
u . 7 4 01 AACAATTCAA TCTAATATGA TTCCTTATAA AGATTCCAAA GAACTTCTAG 

7 4 51 CGAATTTAAT GTGAGAAATG TTTTTGCCTT TTCGACCTTT AGATAATCAT 
7 501 GTAGTTCTTT CCCATAAGGA AGGGCTATTC TCCCTTCCTC ATCAGAGGTG 
%l 7 551 CTCTGGTTCT CTTCTTTTCT GAATGATTCA CTTTGGAATT TTCCTTCAAA 
;s 7 601 ACAGCATAGC AAAACAAAAA GAAACTATTC CCATTACTGC ATAGATCTTC 

^ 7 651 CCAACTTATC CCATAGAAGG TGTGTTTGTA GGTAGGGAAA GGTGGTGGTA 
7 7 01 CCCTCATTAT ATTATTTAAC AGACCTTATG CCTTAGTGCA GTGACCCTTT 
77 51 GGTGAGTGTT TATTCCGTTT GGTTGAGCAT TTGTGAGATG TTTGACTTAA 
H 7 801 TATTCATGTG AGTCAAATGT ACGTATCTGT ACAAAACATG CTGCCCTTCA 
%\ 7 8 51 TTTTCACTTG CTATCTTCCT ATCCATGGTC TTGTTTGGAG AAACCGACTA 
tm 7 901 ATGTTGCAGG ATGCTAAAGC TGGTAGACCT CTCCTTCTGG CTCACTATGT 

7 951 CTAAGCAGAG CCAGATATAG CTGGGAAACT TTATATCCTT CCCTCTAGGA 

8 001 CTCAGGAAGG AAGGATCAGA GATGCTACTC AAATGGGCAT AGAACCTGTC 
8051 CTGCTGCTTC CTGCCTGTAC CACCCAACAT TCTATCCCAA CATTCCTGCC 
8101 TGCTAGGGAA AGCATGACCC GTTCCAGAAA TAGGCAGGTT GTGTTTTCAT 
8151 AGTTCTTTTG TAGGTCATAC CCTTGGTTGN NNNNNNNNNN NNNNNNNNNN 
82 01 ATTCTTTGTT TTGTGTTCTT GACTCTTAAA CAGATCTCTA GCATATTGAA 
82 51 AATTCAACAT TTGATTTTCT AACTGTCATG GGCTTTACTT TTATTGACTA 
8301 TTGATGTGGC TGTTTATTGT CAGGTGAAAA TTTTTTAATA TGTTCACACA 
8 351 CTGATGATTG CATATTTGCA GCACACAGCA TCTTAAACCA CTCAGAGGTT 
84 01 TGTCACAAAA TGTGTGTTTC TTGCTCTGTA ATTTTTTTGT CATTTTGATG 
84 51 GCATATTTTA ATTATGCTTT TATTTCTCTC CTTCTACCAA GTGGAGACCT 
8501 CTGTAACAAA GATTTTTCAG GGTTGCATAT TTCATCTTTA AAAGCTGTGT 
8 551 ACACTGTGGA AACAATTGTT AACCAAAAGC ACTAGATAAT TTAGGATAAC 
8 601 TGAGTTCAAG TTCTGACTTT TGTACCTGAA TAGGTGTCTA ACATTTCTCA 
8 651 GCTACAGTTT TCTTATTACA CAAGCATATT TCCAAGGTAT GTTGAAGCTC 
8 7 01 T AAAC T G C AA CTGAAAACTT CCTTAATGTA GAAAAACTAT ATAGGATCTA 
87 51 AATATTGTAT TTTTGTATGG TTGTCCTTCT GTTTACTCGG AGATTTGACT 
8 801 GTATATGTGG CTTATGACAA TAGCATTTTT GTTAAAAGCA TTTTATAGAA 
8851 GTGTGAAGAA AAACTAAAAA TACAACCAGT TCCAAGGTTT AACAAAACTA 
8 901 TTCCGTTTCT GAGTTCTTTG GCTGTCATTG AGCAACTTGT GGTTTCTGAA 
8 951 GGAAATTATG TGAATTAGGA TGGTTTTGTA TCATTTATCC TTAAGAACAG 
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9001 GGAAAATTGA GATGTTTTCT TATGTTTCTG CTGGAGATTT TGGAAAGATG 
9051 TGAAACCTAC ACCTACAGAT TGACCTTGCT TAGTTAGCTC TGAACCTCCT 
9101 GCTGCCTCTT CCACGTAAAG TGAAAATTTT GGATTCTTAT CGGCTTCAGA 
9151 TAAACTTACA GGTTAGTGAA ACATAGGGAC TGAGATATAG TAATTCATTC 
9201 TGAAGCTGTT TTGGAGTGGT CAAATAATTT TAGTTGGATA AT AT AT ATT A 
9251 CTGGCTAATG ATTGTGGATA TTGGAAGTGA TGAAAAAATT ATTGAATTAT 
9301 TTCTTTCTGC ATTTCAAATG AAAAGGCTAT TAGTTTGAGC AGAGAATTTT 
9351 GATTTAGTAA ACAAAATATT TAAATTTCAT GTTTCATTTC TTTCTCCTAT 
94 01 CTGGGTTCAG ATACTCAGTC TTATAAATGG AACATGATTT ATTTTTGCTC 
9451 CCTAAACTGG TTATTAACTT CCTGTCCATA AT CACAAAAC TAT AT AG ATT 
9501 ATATATTTCT TTGATTATTT GGATTTTGAA TACTCTCTTT AAAATAATCA 
9551 AGAGAAAATT AGAGCTGTTA GAATGTTAGA ATTTGTTTTG AAGGCCACAC 
9601 ATAGTGTTCC CTCCACAGAG AGGTTATACT AGTAAATGCC TTTCTATTTG 
9651 AGGTCAACAA CTATGACAAC TTCCATTGAA CATGAGTTAG TATTTTAAAC 
97 01 GTAAAGCAAT TTTTATACCT GTATGCACCC AAAAAGTAAC AGGGGCTCTC 
97 51 AAAAGGGGTG GGGATTGTAC TGTTTACATG TATATTGAAG ATTGCTAGCA 
9801 GAATTCTGGG GCCAGCTTGG TGGAGCGGAG TACACTTCAT TGTCCTTAGT 
9851 GTAGTAGCTT CCTCCTCTAA TTTTGAGGTG AGAATGCAGA ATCTGTTTTT 
9901 TGTTTGTTTG CTTTTAGGTA GGAATAAAAG CAAGGCAAGG AATAATTTTG 
9951 ATTACTTGCA ACATTAAACT TGAATCCACA AATCCTTAGG AAGTGAAGTT 
10001 TTTGATTAAG ATTATTTAAC TGCCACTTTC CTTGAAAGGT TGTTTAAGAA 
10051 CATCATGTAC CTTTGGGTAA CTTCAAGTGG TCTTGGAATG CAGATTCCAA 
Q 10101 AGTAAGATCA GCGTTGAAGA ATCTTGACCT TTTCAAACAG GTAATTTGTT 
;S 10151 AGTATGTGTA GTCTTCAAAG TTAAGTTTCA GAGAAATTGC TTTGCTCTTT 
102 01 TATTCTTTTC CCCAATCAGA ACTGATCTTT ATGTAATACT ATTAAGATCT 
j": 10251 ACTAATTTCC TGAAATCCCT TCATAAGCTT AATCTGGCCA GGTCTTAACC 
'% 10301 TTTATGGATT AGAAATTTTA GTACTTCTTA AGCTAGAAGG CCAGGCCAAC 
; '4i 10351 TAAAGGGAAG CACTCTCCTC TCTGCCAGTT CAGCAACTAG ATCTGTCCTC 
u 10401 AAATACCTGC CACAGGGATA TGCTGCTTGA AGTTGCCCAC GCATGTACAC 
] m 10451 ACTGGGACCA AGAAGGCACT TCTGGTGCCA GAAACAACAC TGTGTTTGCT 
10501 TTGTGGAAAT TTTTGATATG CTTTAAAAAA TGTAGGTGTC TTCTCCCTCA 
%| 10551 CCTCCTGTGT GTAACTTCCA GCATTTCTTT TTGTTGCTTT TTATCTACAG 
«. 10601 AATTCATGTT CTTTGCATTT TGAGTTAGTT GAATCCACTG TATGCTTTCC 
isa , 10651 AGATGATAAT TAGTGAAGCT CAATGATTCT ATGCAGTGTC TTTCAGTGGG 
;™ 107 01 TAAGGGAAGA TATTCATCAC AGGGGTGGGC CTTCATCACA GGGGAGATGG 
M 10751 AAGGGCAAGG GAGAGGGATT CTGATGGCCT AACCCTCATT ATTGCCCCCC 
m 10801 ACTGCCACAT GCAGACACTC TTGCCGCTTT CCCATTGATT TTGTGCAATA 
%j 10 851 TTTTAAATTT TTCCACCAAT CTTTGGAAAA TATTAATTTA TTTTGTAGGA 
r i 10901 GAAAATGTAT TCATTTCCTT AAATATCCCT CCTCAGAGCA ACAAGCATGA 
7] 10951 TTAGTTTTTG GTATACTTTA AAAAATATTT TACCTACTTA TGGGTAAATT 
11001 GCAATAGATC TCTCCCCTAT CACCCACTTG TTTTGTACAA ATGAAAGCAT 
11051 ACTATATACA CTGCTTTGTA TGCTGCTGCT TTTTCTTAAC TGTACATGTT 
11101 GAAAGGTTTT ATTTATAGAC CTTCTTACNN NNNNNNNNNN NNNNNNNNNN 
11151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
114 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNC CTCCTTACTT TTTTATGCAC 
114 51 TCNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11551 AAAAAACTTT GAACATATGT TAAAGATGTT GTTTTGCATG TTCTTAGTGT 
11601 TTCTGCAAAA TAAATTTTTA TATGTGGTCA AATGGCAGAT ACATTTGTAA 
11651 TTTTTATCAA TTGCCCAATG ACTTCTTACT CAGTAGAGAA GATAGTTTAA 
117 01 GGGGAAAGAC TGAATGGCGA TGCTGACCCA AAGGCTCCAA CAATCCCATA 
117 51 TCCAGAGTCT GATCTTTCCT TGATTTTAGG ATTCTTTGAT CCCTTTTCTT 
11801 TCCCAAGAAA TCCCTCTGAC AACTCAGTGA ATGTCCATCT GCTCCCATCA 
11851 TCTGTTTCCC ATCAGACCAG AGGCAATGGG CTGAAGCTAG GAGAATGGAG 
11901 TGAGATGTGT CCATTTGCCA GGGTCTCCCA GTGGCTTCCT GCCTATCATT 
11951 TACTTGTAAG AAAGAGCCAC GCATTCCTTT AGGAATTGCT TAATTCATTG 

FIGURE 3D 



12001 ATCTTTTATT ATCTTTGTAG 
12 051 NNNNNNNNNN NNNNNNNNNN 
12101 NNNNNNNNNN NNNNNNNNNN 
12151 NNNNNNNNNN NNNNNNNNNN 
12201 NNTCAACTGA TGGAGTTGAT 
12251 CTTGNNNNNN NNNNNNNNNN 
12301 NNNNNNNNNN NNNNNNNNNN 
12351 NNNNNNNNNN NNNNNNNNNN 
12 4 01 CTTTGTGTAT AGTATGGGCT 
12 451 GGAGGAATAT GTAGAATTCT 
12501 TTATGATAAA AAGATTTTCA 
12551 ACTTTTTGTA GTGTCCTTAA 
12601 ATTCTAATCC TTATTCCAAT 
12 651 GTAAGAAATG CATGATTAGG 
12701 AGACAGCAGT GAAGGTAAGT 
12751 AGTTNNNNNN NNNNNNNNNN 
12801 NNNNNNNNNN NNNNNNNNNN 
12851 NNNNNNNNNN NNNNNNNNNN 
12 901 NNNNNNNNNN NNNNNNNNNN 

12 951 NNNNNNNNNN NNNNNNNNNN 
13001 NNNNNNNNNN NNNNNNNNNN 
13051 NNNNNNNNNN NNNNNNNNNN 
13101 TAGATATGGA GGACATATCT 
13151 AATTGAAGAG GTTTCATTTT 
13201 TAATGTAAAG GAGAGTAAAT 
13251 TGTTCAGTCA TCTTCTATTT 
13301 AATTAAATTG ATTTCTTTAT 
13351 GAAAGTATAT AAAGTGGCTT 
13401 TAGGGGCTAC ATGGAGGAAA 
13451 GGTTTTTCAA ATGCAGATAT 
13501 AATTTGGCTA CTAGCTTCTG 
13551 TTTAATTTGT GTTGTTTGTC 
13601 AATTATGGTT TTACTGCATG 
13651 TTTACGGAGT GGGAGTGAAT 
137 01 CTATCATGAA GCCAGTGGTG 
13751 GTCAACATAA GATGAGGCCC 
13801 GTTCTATACA TAGTTTAAGA 
13851 GCTAGATACA GCCCAGAATA 
13901 TTTGGATGGA CAGCTCTGGA 

13 951 TAACAATGTT CTGAAGTTTG 

14 001 CATTTTGTCT TTCTTTTTTA 
14 051 CTAATGGAGT TGATATCCAG 
14101 CCGCCTAGGC TCCAGTAAAA 
14151 TTTTTAATAT CAATTTTGTA 
14 201 CCATTTTGCT ATATACATAT 
142 51 AGGAGGAAAG G CAT G AAC AT 
14301 CTGATACATG GATATGGGCC 
14351 CAGTGCTTAT TTCCCCAAAG 
14401 CAAAGTAACC TCAACTGTTT 
14 4 51 TTTATATAAG TTGCTTAGAC 
14501 AATACTCTGC ATATCAACAT 
14551 TGGGGTGTGC GGAAGGGCTG 
14 601 TAGAACCTGA AATCCTGATG 
14 651 AAAAAAAGCA AATAGTGATT 
14 701 GAGTGCAGCA ATCCAGTAAG 
14 751 AGGTCTTCAA ACGCCTTTTC 
14 801 CTGTCTGAAT TGTCACCGTA 
14 851 GCATTGCCAG GGCCGAGGTT 
14 901 AGTCCTCAAA GATAGTAAAA 
14 951 TTCAAAGTGA TGTCAATAAC 



TTTAGCCATA ANNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
ATCCAGAGAC AAGATACAGA GATTCTGTAT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NT TAG CC AT A ATTTTTATAC 
TTTTTTTTTT CTAGACTTAT AACCTATTAA 
GAGATCTGAA TACCATTTTG GGGACACCGT 
GGAAATATTA AGACATTTTT GTACCAAAAT 
AACCATAGGG GCATATATTT TCTAATAGTT 
TATATACTTA AATTGTCTCC TTTTAGAAAA 
TGAAAAATTA AAAGACACAT ACTAAGAGCA 
ATTTCTAACT TTCCTGTTTT CCTGACACCT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNCA TCATCACTGT TGTATACCTT 
AAAGACATTT GATTTCTGTA TTAGTATAGT 
ATTGCTACAA ATTTTCTTAT GTTTAATCTG 
CATGGCTAAT ATTACAGAGT AGAAACCTTC 
TTGATTCCTT AGAAGAT AC C TTAAAAGATA 
TTCTATTTGT TTATGCCCTA TCTTGTTTGG 
CCAGGAATTC ACACAATTAA AATAAGGAAA 
ATGGGGGTAA AATAAAAATA TTAGGAGGAG 
GTAAGCCATA GGATCCTACA TATGTCCTAC 
GTAGCCAAAG GTAAAAGATG TTATTGCTGA 
AG G G AAAAAG CAAACAAGTT ATGTGGGAGG 
TAGTTCTGAA CGAAGACTTT CACGTGGCCC 
GATGGGATGG AAGGGGCCAC AGCAACATTC 
GTTTTTGAAA AGCTGTTTCA TAGAAGATCC 
TTACTCTAAA AGTACAGTTC ACGGAAAGCT 
AGCATTATGT GCATAGCTTT CTGACGTTCA 
TCTAGCTGTG ATGGATGAAC TGTATAGGGT 
TTAGACAAGA TAGTTTCAGG TTAGAATCAC 
CTGTATTATT TAACAGATTA AAGGCCAGTT 
AGTCGATATA TTTTGAAGAT CAGTAATCAA 
AGACATTGTG AATAGAGCAG GGATAGGGCC 
GGAGGATCTG CAAAGAAATT AG T G G GAG AT 
TCCATTGGAT AGCAATATAT AAATTCACCA 
TCATTTCCAT GCCAGTTATG GCTTGGAATA 
TGTTCCCAGG CAT CAT TT AC CCTACTGATG 
AGTGAGCCCA GGAACAACAG AATCCCTGAC 
TTCCTGATTG CTATAGGTGT TCCGGGGAGT 
TCTTAATTCA CCTGGATTTA TAAGGGTTGA 
CTGAACAGAC TCAAAGCAGA GTCTGTAGGA 
CCCGTACCCA AACCTAAGTC ATCTTTTCAC 
ATTCTCATGT ACTCTTGAAG CCCTAGGCAA 
CACATATACC CTAGGATAAT TTCTCTCTCA 
TTAAAAATTA TGACTCAATG CATATTCTGA 
TGTCACATCT CCTTGGGAGG AACAAAGAGA 
TGCAGAGACT AGCGTGAACC AAGAATCTCC 
TATCAACGTG GCAGCGAGCT GAGAAAGTTA 
TTCTGTCTTC ACGGTAACAA ACAAATGATC 
TGTAGCCAAG ATAATTTGGC TTGAAAAAAA 
CAGCTACAAC CACAGAAAAA TACT AG TAG C 



FIGURE 3E 



15001 AT ATT GAT GG TGCATCCCTG 
15051 ATACACCAGT TATTTAAAAA 
15101 AGTAATATCA AATACAAATA 
15151 TTTTAAGTTT TCTGATTTTA 
15201 GAATTATGGC CATAATAGAA 
15251 TGAAGGAGTA TATTACAGGC 
15301 AAAATAGTGA CATCTATTTT 
15351 GAGAATTTCA GTTTAAACAG 
154 01 ATTCTTAACT CTGATGGGAG 
154 51 TTAGCTTTAG AATATAGTTG 
15501 ATTATTTTTT ATTAGGAGAC 
15551 GATACTAGAT GCCAGGGGAG 
15 601 GTTATTTGCA TGTTGAGAGT 
15 651 CTTCATTTGA TCAATATCTT 
15701 TTCTCCAGAG CTGACATTAC 

157 51 AGGGTTGCCT TTTGTAGATG 

158 01 ATGTCATCAC AGTTTAAATT 
15851 CTAGGAATTT TGGATTTATG 
15901 AAAAACCTTT AGCCNNNNNN 
15 951 NNNNNNNNNN NNNNNNNNNN 
16001 NNNNNNNNNN NNNNNNNNNN 

ft* 16051 NNNNNNNNNN NNNNNNNNNN 
'[% 16101 NNNNNNNNNN NNNNNNNNNN 
'" s \l 16151 NNNNNNNNNN NNNNNNNNNN 
hSI 16201 NNNNNNCTAA CCTTATTTTC 
Si 16251 AGGAAAGCCC CTAAGTGGTC 
r?| 16301 AACTCCATGG ATGGACTTTC 
J"; 16351 CCAAAGTGTT TAGTGCCTCC 
\* 164 01 TACTTGGTAA TTATCAGCAG 
CT! 16451 ATTGATAATT AATAGAAAAA 
%| 16501 TTTAGTGTAT TTCAGTGTTT 
16551 GTGGTAATAG TTTCTACTGG 
f\ 16601 AAGTATTTTA TGAAGTTTGA 
2 16651 TCTCTTGGGT TGAGTTTTTA 
U 16701 GGTGTCTATG TATGGCCAAG 
H 16751 TTCTAACTAA GGCAAAAATG 
%\ 16801 AGTATAAAAT TATGAAGAAA 
.^j 168 51 TTAAATCTTG TGTTGAGGTC 
U? 16901 TAAAGTAGTG TACTGAAAGT 
H 16951 TGTCCATAGG GTCTGTTGTC 
17 001 TAGGTAGTTT TCTGTGAAGA 
17 051 ATGAAAAATA TAGCATGAAA 
17101 AGCTATTGCT TTTTCTAGCT 
17151 TGTGTCAGGG AGGACCTTTC 
17201 CTTTAACAAA GTGTGTGTGT 
17251 TGCCTCCAGT CTGAGTCACA 
17 301 GCATTTTATT ATTTCTTTTC 
17 351 TTTTATAATT CTTTACAGTA 
17 4 01 CCTTTGGATT AAAAAAAAAA 
17 4 51 TGATTTTCCT AACACGGTCC 
17 501 CCTGTGTTTA GCAAGCAAGA 
17551 GGTTTATGAG TCACAATTTC 
17 601 ACCACGTATA GTAAAGAACC 
17 651 CAAGTTGCCG GAGAGAGTTC 
177 01 TGAAATGTTG CTTGTATTTG 
17751 GCAACAGATT ATCAAGAGAA 
17 801 ATGAATGAAT CCTATTTTAG 
17851 AAAAAAAAAT TGTATGTAAG 
17 901 TGTTGTGATG GTTGCATATA 
17 951 TTCCCAGAAA CTTCCTAGAA 



GGAACATGCC AGGGCAAATT GGGGTGCAAA 
TTGGTATGTT ATAAGGTAAA GAATTCATTA 
AACTTTTTAT TTTGATTCAC TAAAACTTCT 
TTACTTAATT ACTACTGATA CATAATTAAA 
TTCCTAATAA AATTTCTAAT AAAGCCAGAA 
CAGACATGAT AAAGCATTAT GATGTGTGGT 
TTCATTCTGT ATTATTTTAT AAATTTTCTG 
CCTGCTGAAA TACTGTTAAA TCAACCTGTT 
GAAACAGATA TGAATAATAA AAATGATTTC 
TTGCTTTGGA GAACAAACCG TTTTATTCCA 
TTCATTTTCT GCCATACATT AGCTTTGGTA 
TGCAAATTTG AGAATAAGGA TTTGGCATGG 
CAAGATTTAA CTAAATTTTA AAACCAAACA 
TTTAACCATT CTGTAGATTA ATAATATAAA 
TTTGCTATAA CATCATCAGA TCACAAGATT 
TTATTCACTT ACTCTGAAAT GTTAGGAAAT 
TGTAGTAATA T AT AC AAAAG GAAAACACAA 
CTTACTTTGC CAAAAACCAT GTTGATTTTC 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
GTAAATCAAA GTTCCTAAAT TCAGTTTCTA 
AAAGGTGTGT GTTGCTGAAT TTCAGAAACA 
CAGACATAAA CTTCCCACCA AAATATTGTT 
TATTTTATTT TAGTGTTAAG ATTTTGTAGG 
AATATTTACA TACCAACAAA TAGCAAAGCC 
CAAAATCATA GCTTATGTTT AGTTGTATTT 
AGAGTTGATT TTATATTGCA TATACTAACT 
ATAGATTATT TCAAGTTCAC AT GAT TAT AG 
CATAGATAAT GGTGGTTTCA GGGTCCCTTG 
CATCATTGAA GAATTAAAAG ATTGTGTCTT 
AATTAAGACA TTCTGATGTA CAGTTTCATT 
ATTTTTGAAA CAATTACTTT T TAG GAT AAA 
TAATGTGTAG GTTTGCTAAA TGTTGTGTTC 
TGATGAATTT TTTTATATCT TTAATAGTTC 
TAGGATCATC CATGTTGCTA CTTAACTTGT 
CTGGATGTTG GTGTTATAAA TGTACAGAGT 
GTTTGCAGGC TAATATCACT GTTTTTGACA 
ATTAAGGTTG GGGTATGGAG AACTTTCTAG 
GATGATTAAG GTTGAGAGGC AGAAGTACAT 
CCTTTTATGC ATGCACCTGG CATTTATCAC 
CATGCTGTGC TGCTTGCTTA AGGGGCTGTA 
TGGTAACTGC ATCAGCAGTC TTAGCCTGTA 
AAAGTTTACA CTTGGCCTAA TACT TAG AC A 
AATGAAACTA TACATGAGAA GATGGGCATA 
AAGGCTCAGT TCTTTAAAGT TTCTTATCCT 
AAAGTTCAGG ACTGGCTCCA AACCCATAAA 
AACATAATCC CCAGATAGTT TAGTTTCTCT 
ATAAAATCAT CAGAGTGCTT ATTAATTCCA 
TTCAGATGAA CTGAAGCAAG GGTTCTTAGG 
TAGAATTCTA AACTACCTGA GTAGCTTTGC 
CTACTGTGGC CATTTTATGA TGGCCATAGA 
AATGAGACAG ATTTTTCTGT ATTATGTGCT 
TGAATGTTTT AATGGGGTTT ATACCGCAAA 
GCATTTATTT CTGGCAACTT TCATAAAAAT 
AAAATTTCCT TATCCTTCAA TAGAGGATAG 
GTAATCTATT CCAGATTTAA CATTGCTTTG 



FIGURE 3F 



13 001 ACATAAAATG CAGTTTTGTG 
18051 GACA7TGGCT AAAATTTTAT 
18101 AACTTACTAG ATCTATGTAT 
18151 ATATGACCCA ACCTTAAGAT 
18201 TGTTTGGCTA TTTGGTACTA 
18251 CTATTCTTTA TTAAGTTGTT 
18301 TTTCCTGATA AATTTGATGA 
18351 AAACGGTATC CTTTTATATT 
184 01 ACTTGTGTGA TCCAGAATGT 
18451 TTTGCTTAGA TTGTACCTGA 
18501 TCAACATCAA GATATGGTGA 
18551 CCAATATGAA AAAAAAAAGT 
18 601 AAATACATAG CAATCATTCT 
18651 AATCAAATTG GACTTAGAAT 
18701 CCCCATTCAC CNNNNNNNNN 
18751 TTGTAAATTA CAGTGGATAT 
18801 GAAGAATAAA ATTATCCCTT 
18851 AACACTGACC ATTATTAGTA 
18 901 CAAACTGGGT TATTTATGTA 
18 951 TAATTCTTGT TGCTATTGAC 
19001 TGTGGAAGGG AAGAAAAAAA 
19051 AGGGAACATA CAGTATGTAA 
Q 19101 AAAACTATTA GCCTTTCTTA 
:|| 19151 ATGTGTTTTC CAACTCCAAT 
.j\ 19201 AGAAATTAAT ATTCTGGAAA 
19251 CAAAAGGTCT GAAATATTTA 
"'I 19301 ATAACTTCAA TTATTGGTTA 
Hi 19351 CAAAATGGTG ACTAAAATAT 
U 194 01 TTGTTTCAGA AAAGCACTGA 
19451 TAACAAGTTG ATAAATACCC 
!f[ 19501 GGAATTTCTT TTTTTGGACA 
H 4 19551 AGAGAATAGC CAGGATAAGG 
$ 19601 CTTGAAATAA TTTTGGCTAA 
U 19651 AATTTTACAA AGCTGTAAAC 
P| 19701 TTTTTAGGAA AACTATATTT 
j'"f 197 51 ATAGTAAGTT AATTTTAATT 
| ?B * 19801 TTGATTATTC CTTTCTTCTG 
%| 19851 CTAAGCTAAA GGACAGTTTA 
f;;^ 19901 AAACAAAGTC CACCAAATGA 
19951 AGGGAGTTGT ATTTCATATT 
' : '* 20001 GAAATATTTT TATTGTATGA 
20051 GGCATTTATG CTCTTGTTTT 
20101 TTTTTGGCAG TAAATAATGT 
20151 AACGTTGGTG AAATATGTGG 
20201 GAAGACAATG NNNNNNNNNN 
20251 NNNNNNNNNN NNNNNNNNNN 
20301 NNNNNNNNNN NNNNNNNNNN 
20351 NNNNNNNNNN NNNNNNNNNN 
20401 NNNNNNNNNN NNNNNNNNNN 
204 51 NNNNNNNNNN NNNNNNNNNN 
20501 NNNNNNNNNN NNGAGAATGT 
20551 GTCATGAAAA ATGTTTTTAA 
20601 AACAAAATTG ATAGTACTAG 
20651 GGAGACTTAA AG CAT AT GT A 
20701 CTAGTAGAGC TATGGCTTAC 
20751 AATGTGTGGT TGGCCACCAA 
20801 NNNNNNNNNN NNNNNNNNNN 
2 08 51 NNNNNNNNNN NNCTAACTTT 
2 0 901 CTTCCACAGT TCTTGACAGA 
20 951 ATACAATACA AATGGAGAAC 



TAGTTTTAAA ATGCAAATTA AAAAATATAG 
CTTGAAGTCG GGTATATATT GATACCATAA 
TTCAAGGCTA ATTTATGCCA AG T AG G AAAA 
ATTACAAGGA TAAAATAGAC TATACAAAAC 
ATACAACTAG TTAGAACATA ATGATGTTTG 
TTACCTTTGC TTACAATAAT TTAAAGTATT 
CTCAAAATTG GCAATTAAAG AATATTAAAG 
TTTTCTGTCT CATATATAAC CATAGTCATA 
AATTTGCTAT TTACACTTTG ATTTCAGCTG 
TGTATTTTAT TATTCTATTT AAGGAATGTG 
GTTCTTTTCA AAATAACAGT AGAAACCTGA 
CTAGCAAGCA AATGTAATTT GTGTCTTTAA 
GGATCAATAG TTAAATTATT GCTTCAATTA 
TTTTTTCTTC TTATATTACC CAAAGGAAGG 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNAA 
TTAATCCTTT AAAGGCATTA ATTTAGCAGA 
TTCTTACCTC TAAAATCTCT AGGTTGATCA 
CCATTTCAAG TTTCTTCCCT TTACTTTCAT 
TGTTCAAGTG AAATGGCTGT GCTTTCATGG 
AACCAAAGCA GCCATGCAAG AAGAAAATGC 
TTATATTTCC TCCCCAAAGT TGGAGAGAGA 
GAAACAGGTA AGTAAAAATA TATCAATTTT 
CTAATTGATC AAAAGTTTAA ATTTTTTCAG 
TTAAATAAAA GGATACTGTC TCTAGGAATC 
TGAAAAATTG GATTTGAAAA TACAGCATCA 
AATTTAGAAT TTGGACTATA GAAGAATAGC 
ATATTTTTGT TATGAAATGT TTTTTTATAA 
TACTATTTTA AACATGTCCT AGATTTTTTT 
AAGTTGAATA TGTGTAAGTC TCCGGGAATG 
AAGTCCAGCC TCCTTACAGA GAAAAGATCT 
TTTGTTTTGC AGCTTTCTAC GTCTTAAGGC 
TGCCAGCTCA AAGGTTAGAT GAACATATTT 
TCTATGTCTT GAAAGGCATA CCTTCTAAAT 
AAAACATTAG TTGTGTTTTT GAATTGCTTC 
CTTAGAGATG TGCTTATTCT ATACATAATT 
CCATTATTAC ATTAAACTTT TCCGTTTCAG 
TTTTATGAAT CTAATAAATT GACCTCCATT 
TGTTTTTAAT ACTTAAGGAA GCAAATATAA 
AATTACTAAC AC C AG C C TAT CATTAACCTG 
CATTAATTCT CCATGGAACA ATTCAGGTAA 
AATATGTTAT CAGTTATAAT AGCACTTACG 
AGGTACATAA AGGGTTCCTT GTTTAAGATA 
CACTTATCAC CCTATGACAC TTTAGTGGAA 
AATTGTAATG TTTAATGGAA TTGATTCTAA 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TCTTTAAACT GATTGAAGAA ACTATCACAT 
GGCAGNNNNN NNNNNNNNNN NNNNNNNNGG 
AAATAGAAGT TCAGAAATTT CTTTGTTCAG 
TTTTAAAGTT ACTAGCAGTT ATAGGATTGA 
ATTAGGGAGG CAATGCCAAG GAAAGAATAG 
CATNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
GCAAGATTCT TACTGGTTAT AAGGAGGTGG 
TGTTTGCTAT TATTAGTGTG TTCTCAATAT 
TCTGAGCCGA GTTTTACTAG GATCCATCCT 



FIGURE 3G 



21001 AG G AC AG AT G GTACAGCACA 
21051 CTTTTGGAAA TAATCAATTG 
21101 CTAAAATCAA AACTTTCATG 
21151 TCCCATTTCT CTAATAATAT 
212 01 TGGTTGTCTA ACAGTGTTTG 
212 51 ATTGAATTGT TGAATGTGAG 
21301 NNNNNNNNNN NNNNNNNNNN 
21351 NNNNNNNNNN NNNNNNNNNN 
214 01 TAAGATGATT GGCTCTTATT 
214 51 AGGGATAAAG ATTTGAGGGA 
21501 AGGATCAAGG AAGACTTCAT 
21551 ATATATATAT TATCAGTATA 
21601 ACAGAGGAAG GCAATACTGG 
21651 T GAG AG CAT A TCTTAAGGGG 
217 01 TGATGGCCCA GCTGAATCCC 
217 51 CTGGGCACAC AGAAAAATGC 
21801 ACATTCCGAG AACTGTTAAG 
21851 TTTGAAGGGC ATCTGATGAG 
21901 CTGGAAGGCT TTTGATATCA 
21951 TGTAGAGGAA CACTGTAAAG 
22001 GATAGCTGGT GGTAATGGAT 
22051 GCTTGGAGTG TTTGTAGTTG 
Q 22101 GGTTGGTTCA AAAATCGGTA 
M 22151 TGGGGCCTCT TAAGTAGCTG 
.^5 222 01 CCCTGTGGAG CTCACTTCCT 
22251 TCAACCTGGT GAGAAATTAG 
%f 22301 AAGAGAAGGA GAGGGATTGA 
Ly 22351 CACAGTGTTA GGGCAGTGGC 
22 4 01 TGNNNNNNNN NNNNNNNNNN 

22 4 51 NNNNNNNNNN NNNNNNNNNN 
: f[ 22501 NNNNNNNNNN NNNNNNNNNN 

22551 NNNNNNNNNN NNNNNNNNNN 
es 22 601 GTTGTGAAAG GTGAAGAAGA 
U 22651 A CATC ATT TG GTGAGCTAGA 
227 01 CAATTTTGGA TATGTTCAGT 
22751 AAACTTGATA TGTGCTTGGA 
22801 CTATGATTTC ATTTCTTTCC 
SI 22851 CTTCGTCAAC TTCAGTCAAC 
n 22 901 GGAACAGCTT TAAACAGTAG 
^ 22951 GTTCCACATC TGCTAAATGT 
1 23001 ATCTTTCCAG TTTACCGTTT 
23051 AACATTTGTT TATATCAAGG 
23101 TTTTCTTTTT TTCCCTCTTT 
23151 CACTCCTAAC TCTAAGAAAA 
23201 GTGAATTCGT ACNNNNNNNN 
23251 NNNNNNNNNN NNNNNNNNNN 
23301 NNNNNNNNNN NNNNNNNNNN 

23 351 NNNNNNNNNN NNNNNNNNNN 
234 01 NNNNNNNNNN NNNNNNNNNN 
234 51 NNNNNNNNNN NNNNNNNNNN 
23501 NNNNNNNNNN NNNAGCACAT 
23551 ACAATGTATT AAAAATGAGC 
23601 TTCTCCAGTC TGCTTTCCTG 
23651 CTCTACATAT AACACTTGCA 
237 01 GGTTATTTAT GGATCTATAT 
23751 AAGGTGGAGG CAGTGTCTTA 
23801 CAATGCCTCG TCCAGTTCAG 
23851 GTCATTTCTA TTTATTAATA 
23 901 GT AAT CT T AA TAAACATATT 
23 951 ATGAATATAT TTGTCTCCCT 



TTTTCAAATG GTTTTGTTTA C CAT AAT CAT 
GAAGTAGAAC AATTCTTCAG TTATTCATTT 
GGATAGTTTT CTGTGCTTCT GAACCGTTTT 
CCATCCATAA ACCTCCGAGT GGAACAGTTT 
TGCTTGGGAA ATGAATTGAA GGGAGATGGA 
CATTTAGAGT TCCTGTAATT TATCCGTNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNTTTG 
CCTTCTATGA TTCTTAAATA GGAAAAAGTT 
GTTCAGGGAA AGAATCATTT CCTGCTGAAG 
GAACTCAATG GTATTCAGTT GATACTATAT 
TCTAGTATAT AGTATCAGTA TTGAAAGCCC 
GTAAGAGGCA TGTTTTAGTA TAGGATTCTG 
CAGTAGCCCA GACCAGCAGT GATGGAGTGG 
GAGGGATAAA TCAGTCAAGT GAAGAGTATC 
ATTTAAAGCT CAGTAGCCAG AG G GAG GAG C 
AATCTCAACC CAGCATATGT GGGGTATATG 
AGACGCAGTG GGAGAAGTGG GAAAGGACTC 
TATTAAATAC ATTTGCCTTA ATCACAAAGT 
ATTTTAAGCA GAGAAATTAA TATTTTAGAG 
CAGATCTAGC ACTCAGGTGA ATTTAGTTTT 
TCAGTGTCTC ACAGGAGACA CGGATATAAG 
GGTCTGCATT GCTGCGAGGA GTCTATCAGC 
GCCCATTTGG AAGAAGTATG TGCCTCATTT 
TTAGTGCAAA TGCCTGGGAC CTGTAGGAAG 
GCTTCATTTT AAGTAGTGGT GGTGGGAACA 
AAGATACTGA AAAGAAAGGA TGGAGAGGCA 
GGGGGGGCAC ATGGGCTTGG CAGACAGACT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNCAGTATT TGTAATGCTG 
TGTAATCAAG ATTACTCCCA AGTCTCTAGG 
GAACACAGGA GAAGGTGCAG AT AAT G ATT T 
TTGAAGCGCT TGTGAGACTT TGAAGTAGAG 
TTGCAGGGTG AGATCTGAAC AGAACACAGG 
ACTATTCTAC TCAAACCTTC CTTCCTTTTC 
ATTGATTATT GCCACAACTA TCTCCACGGA 
AAAAAGAATG TGGGATAGGT AAAAATCTGT 
TAT CT AAT AG TTGTAAGTTC TTGAAAAAAA 
TGGCAAAGCA TTGTAAGCAA CTAATAATTC 
GCTGAAAAAA AGTCATCCTC TACAAACATA 
TGGTTGCCTT TTTTTGTTTT ACATCACCAC 
TATTTGACTA AAAGTGAATC ATTGTTAGTA 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TTATTTTTTA AGTTTATATT CTGTCCTAAT 
CCTGCCTCAA GTCCTAGCCT AAATGCCACT 
ATTTTCACAG CAGGAAGTCT TCTGTACTTC 
TGAAGCATAT CTCACATTGC CCTACATTGT 
TATTTGTTCT GCTAGACTAT TAGTTTCATG 
TACATCTTTA CGCCAGTATA ACAGCTTCTG 
TTCAGTACAC TTTTGAAGGT GTAAGAGCTA 
AAAGGGATTT AAAAAAAAGA TGGACAAGAT 
GTGGTTTTCA GGAAACAGTT T AAT GAG AAA 
ACCTTATGAC CTTATTACAT ATCATAGATC 



FIGURE 3H 



24001 TTCACTCTTG CTTTAGATTC 
2 4 051 CACATAAAGG TAATCTGATT 
2 4101 GTTCACAATT ATGGGTAGAA 
2 4151 GCTGACATTG CCTTATCAGT 
2 4 201 GCAGTGGACG GCCCTTGGGT 
24251 GCTTTTCTTA TGCTGCTCAG 
24 301 TGGTGAATTT CTACTTATCC 
24 351 AGTTTTTCCT GGTATCTTTC 
24 4 01 TCTTCTGTAT TCTTTTAACA 
24 4 51 AGGAGACTGT ACACTACAGA 
24 501 TCTAGTGAAG GAGCTAGATA 
2 4 551 AT AT AT AT AT TTAAAAGTAA 
24 601 GTATGCACAG AGGCTTATGA 
24 651 AGGCGTGAAA ATCTTGGGGG 
24 7 01 CTGTCTAGAT ATTAGTGTAG 
247 51 AAAGTGTGGA GGTACTCGAG 
24 8 01 TATATTTGTG AATGTGATTG 
2 4851 ATATCATGAA ATATGGTACT 

24 901 AATGATCTAC ATGAACATGA 
2 4 951 GAATGGCATG CTGAAATCAT 
25001 AGGCCTGGAT TGAAGTCCAT 
25051 NNNNNNNNNN NNNNNNNNNN 
25101 NNNNNNNNNA AGGTTCTTCA 
25151 TTTAGAAAAC AAATAAGCAA 
252 01 ATCTCTGACA TCATATTAGT 
25251 NNNNNNNNNN NNNNNTATAC 
25301 CATAGAGATT TTGGTTGCTA 
25351 TCTGGCCTTT TGTGATGATT 
25401 CAGATGTTTT AAAAAATGAT 
25451 ATTTCATCAT ATTCTATGAA 
25501 TTCAGATATC TATTATATTT 
25551 ATCTTTTCCA AACCTTTTTG 

25 601 CCTGTTCTGT GTCCTTAACA 

25 651 TTCTGTTTCT CCTTACTAGA 
257 01 CCTATTCATT TTTATATCCC 
25751 GGCGTTTCCT CCTTGTCATA 
25801 NNNNNNNNNN NNNNNNNNNN 
25851 NNNNNNNNNN NNNNNNNNNN 
2 5 901 NNNNNNNNNN NNNNNNNNNN 
25951 NNNNNNNNNN NNNNNNNNNN 
2 6001 NNNNNNNNNN NNNNNNNNNN 
2 6051 ACATTGAAAA GTTAACAACT 
2 6101 ATTCATGTTA TTGTACTCCT 
2 6151 NNNNNNNNNN NNNNNNNNNN 
26201 NNNNNNNNNN NNNNNNNNNN 
2 6251 NNNNNNNNNN NNNNNNNNNN 
2 6301 NNNNNNNNNN NNNNNNNNNN 
2 6351 NNNNNNNNNN NNNNNNNNNN 
2 6401 NNNNNNNNNN NNNNNNNNNN 

26 4 51 NNNNNNNNNN NNNNNNNNNN 
2 6501 NNNNNNNNNN NNNNNNNNNN 
2 6551 NNNNNNNNNN NNNNNNNNNN 
2 6 601 NNNNNNNNNN NNNNNNNNNN 
2 6651 NNNNNNNNNN NNNNNNNNNN 
2 6701 NNNNNNNNNN NNNNNNNNNN 
2 6751 NNNNNNNNNN NNNNNNNNNN 
2 6801 NNNNNNNNNN NNNNNNNNNN 
2 6851 NNNNNNNNNN NNNNNNNNNN 
2 6901 NNNNNNNNNN NNNNNNNNNN 
26951 NNNNNNNNNN NNNNNNNNNN 



TAAACAGAAA TCAATTTATG TGCTTGAAAT 
GTGTCATCTA CTGCTTACAG TCCATTGGCT 
TCTTGATTAT TTGAGTAGAA GCCTTTTCAG 
GGAGGACTGT TTCTTGTTGT AGGCCCAGTC 
TTCCAGACTT CTGCTGCTTC ATGCCTGTTG 
TCTGAGTCAA AGGCCCTTCA CCCATTCCCA 
TTTAAAATTC AGCTCACATT TCCTCTTGAA 
CTATTCCTGC CTCCCTGCAG AGAAATGCCT 
CCTTGTACAC CTGTCATGTC TATCTTTGTC 
GCAAGTACTG TGTATTTAAA GAGCTCAAAG 
AGAAAAACAA CTCTGAAAAT ATGGTATAAG 
ACCTGTATAT ACATATAATT TATGATAGAA 
AAGCACTGAA GAGGGTATGA ACTCAGCATG 
TGATATAATG GTCAAACTGA CAGGTGAAGC 
CAGACAAGAG GGGGAATGGC ATTCCATGTG 
AGTGTATTTC TAAGAACTGC AATTAACTGC 
GAAATGAGGC CAATGGGATA GGCAAGGGTC 
GAATGCCATG CTGAGTGTTT GAAATAGTAT 
TGAGGAGCCA GTGAATGAGT AT AA G GAG C A 
GGGTTAGGGA GGTCCCTCTA ACAATAGTTC 
GATGTTGGAC ACAGAAAGAC TGTNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
ACAGATAAAT TTGGAATAAT TATCATTTTC 
ACTAAAAACA CCTGAAGGTC AAAATGACAA 
CGAGGCCCAG TAACAAAATT GNNNNNNNNN 
TTCAACTCAT GTATCAGTGA CTCTTTTAAA 
AAAAAGGTGA TGATGGTTAC ATTAAAATTT 
CTTTAGCAAA CCTGCTCAAA ATTACAGTAT 
AATTTTCAAC TGAAAGAAAT ACTGTAATGT 
TTCTTACATT ATGAAAGATA TCTTTTATGT 
CCTTTTTGAA AGAATTTTTT CTAGCACTAC 
ACTTTCCTAA GCCTGTGTAA TTAATCACTG 
TGCATCTGTT ATCAGGACAT CTCTGCTTAC 
CTTACCTGCT GGAGGATGAG AACTTGCTCT 
CAATTTTTAA TATAGTGTCT TAGACACTGG 
CTTATAATCC TCNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNCCTT TCTCTAAATT 
ACTCACAAGT TAAATAATAG TTGTCTGTGT 
TTTTATATAT TTCTTCCATG NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 



FIGURE 31 



27 001 NNNNNNNNNN NNNNNNNNNN 
27 051 NNNNNNNNNN NNNNNNNNNN 
27101 NNNNNNNNNN NNNNNNNNNN 
27151 NNNNNNNNNN NNNNNNNNNN 
27201 NNNNNNNNNN NNNNNNNNNN 
27251 NNNNNNNNNN NNNNNNNNNN 
27 301 NNNNNNNNNN NNNNNNNNNN 
27351 NNNNNNNNNN NNNNNNNNNN 
27 401 NNNNNNNNNN TTCACATTTC 
27 4 51 CTTAACGTTT TATTTGGCAT 
27 501 ACAGAAAGAG AAAGTTAAAA 
27 551 TCAGAGGATT TTTTTTTAAG 
27 601 TAAAAGCTAA CTTGAGATGC 
27 651 GGGGGACACT TCAGGAATTA 
277 01 CTTGGTAAAT GTGTCTGAAT 
2 7751 GTTATTGACA GCTCTTGGGA 
27801 TATTTTAACT CTATGTTCTA 
2 7851 TGACTATGGT TTGTGGTTAA 
2 7 901 GTGAAGCCCA AAGCAAACAA 

27 951 GGTAATAATA ATATGCTGCA 
2 8001 AACAAGCAAC AGGGTAATGA 
28051 GCTACTTTAG TTTTCTTAAG 
28101 CGCTCCTTTT AAGAAGTATT 
28151 TATGTTTTTA AT CAT TAT T A 
28201 TTAGTATTTT GTTATAATCT 
28251 ATGCTGGACT TTCTGGTCTT 
28301 AGCATGGAAC TGGGATAAGA 
28351 NNNNNNNNNN NNNNNNNNNN 
2 8401 NNNNNNNNNN NNNNNNNNNN 
28451 NNNNNNNNNN TGAGTTCTGT 
28501 TGATAACAAT CAGAGGCATT 
2 8551 AGATTTGACC ATCAGATAAT 

28 601 TGGAAAGATG CTTTCCCCTG 
28651 TTTGTTCTGC TCATGATATG 
287 01 CTGTACAATC TCCCTCCATT 
287 51 TTGTTTCATA TTGCCTTCAG 
28 801 TTTGAAAAAG TCATTCCCTG 
28851 TGTCCATATC TGAATTCTGT 
28901 AAAGTCTTCC AGTTTCATAT 

28 951 TTTCTCTGTC CCCTTTTCTT 
29001 TTGTTCTGCT TTTAGTGGTG 
2 9051 GAAAAAATTA CAAATGATTC 
29101 TGAGAAAATC CTTCTCAAGT 
29151 TTGCCTTATT ACTTTTCAGA 
2 9201 CCTTAGTTTT CAAATGTTGG 
2 9251 AAGAAACATT ATAGGACTGA 
29301 TGATAATTCA CTGCTTATGT 
29351 AGACCCAGTT ACATCACTTA 
2 94 01 NNNNNNNNNN NNNNNNNNNN 
2 9451 NNNNNNNNNN NNNNNNNNNN 
29501 NNNNNNNNNN NNNNNNNNNN 
2 9551 NNNNNNNNNN NNNNNNNNNN 

29 601 GGATTTTTAA G AC T AAAATT 
2 9651 ACATGTGACA TTAAAGAGCT 
2 9701 TTTTTCTAGT TAAAACAATG 
2 97 51 GATTATGCTA CAGGAATGCT 
2 9801 TTGGACTTAA TTCTGGCAAC 
29851 TGATTATCTG TGATTTTTAC 
2 9901 TAAACCATAT TTGGATGCTA 
2 9 951 CTTCTTGTTC TGATCCACAC 



NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNTAC TCTACTNNNN NNNNNNNNNN 
CAAATAAATA GTAACACTTT TTAATGTGTT 
TAATCTAAAT TCCCCTCTAG CACAATGAAA 
TTTCAAGTAA ACTGAAAACA ACAATAGTGC 
TGAAAAGGGA TAGTGCTTAA TTATGACAAA 
A CAT AT AC AG ATGCAGCCAC TTATTTTGGC 
AAATTTAAAT AGCGAGAGAT CAAATAGATA 
TGGATGTTCC CAGACACAAA AATAAAATGA 
GACAAC AT TA TAAAGACTAG ACACGTTATT 
AATTACCATT GAGTAATTGA CATTCGTATT 
GTTCTTAATT GCAATAATGT TAAATAAAAT 
CAACAAAAAT TATAGCAATA CTTCAACAGA 
TCAATGGTTC AGAATCCAGC ATCTACATAA 
AATTATTTTC TTTTCAAATA TTCTGGCAGA 
TTATAGATTG TGGTCTTAAC TGCAACTTTT 
TAAGTTATTT AAATGTTACT TAAATACTTT 
ACATTCTCTA CCCCTCTCAT CCTTTCCTGC 
CACTCTCCCC ACTTCCAAAT GCACTCAAAA 
TTTCTGACCA CCAGAAGAAG TAGTGGATTG 
CATTTTTCCN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
ATTCATGAGG GCATTTAATG TGTCTTATAG 
ATTATTCAGT TTTTACTGCA GCAGGAGTTT 
TGACACTATA GTACAGGGTA TTAGGAAAGC 
GAGATCTTGG ACAGTAGGAT TCTTCTAGGT 
CAATGGCTCA GGGCACTTTT CAGAGGACCA 
CCACCCAGGT CCTATTAATA TAGTAGTGAG 
TGACTTTCCC TTTTCCAAGG CCAATTACCA 
TCATATTTTC TTGTTCTGCT CAAATGAATT 
GAAGCTTGTT GGGTGTAAAA ACAGCTTTTC 
CTATTTGTTC TTCTGCCCTA CCCTTGCATG 
CTGTGAGCAT AACCTGGAGG ACAAGGTTTT 
CCATGTCTGC ATAATAACAT GATGTAGATT 
TTGGAATTCT AAAGATAATC TTATTTCTAT 
TACTAACTAC CAT T GAG ATT GGATTGGCTT 
GCCTCCATCT GCTTGGGTAT CTCAACATAT 
CACTTTACTC CCAGCATGAT TACCTTACAT 
TGTGGGAGTT TACTTTTCTC ATAACTTATT 
TAGAGTTAGA AACTATTGTC CAACTCTCAG 
AGATGGATAN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNGGAT 
TTAGATTGCA AT CAAACAGA AGAGGCTTGG 
ATGTTTGTTT TGCCAGTTGA AACTTGTTTC 
ATTTAGAGGT TATTTTGAGG GCACTTTGAA 
AGAGAGCAAA TTTTGAGAGT GATTGCCCAT 
TGATTTTGGG GTAAAATGTC TATCATTCGT 
CTGGACTTTA CTTAAGCTTT ATTAAGTTGC 
GTGATAGCAG ACCATCAAAT ACGGCCCAAA 
GAAGGTCAGA GAAGCAATGC TGCCTTTTCT 



FIGURE 3J 



30001 GATAGCCAGT AGCACCAGCA 
30051 AATATTTTTG AAGACCACTG 
30101 TTGATAAGGA AGGGATGACC 
30151 AAAGGTTGGT TTTTGCTGAC 
30201 TTATTGGAAG GTGAGCCAGT 
30251 AGTTTTGAAA TGGTTATGCA 
30301 TAAAGTTTTT TTTTTACATT 
3 0351 ATAAATTTGA TGAGTTTTAT 
30401 TATCCTAAAT TTGTATCAAA 
30451 CAGACTAAGT CCAGATTCCA 
30501 GTATTTAGCT ACAAGTAACG 
3 0551 GGATGGTGAT TTAGGGTTAA 
30601 ATTCTCAAGA TCGCCTCATG 
30651 GACATGCACG TTCTAGGCGA 
30701 CCTTCTTACA CAGCATTCCA 
30751 CCATTTAGCC AGAACTTAGT 
30801 TAGGGAACAT AGTTTTCCAT 
30851 NNNNNNNNNN NNCCATTCTA 
30901 AAGAATCCAG CTGGCCTTTT 
30951 ATCTGCTGTT TTAAGCACTT 
31001 CTCACTAAGA AGCAGTCCCC 
31051 TATATTCTGA AAGTCTTTTG 
31101 CACTGCCTCT AACACACCAA 
31151 TGGTTCTTTC ATCTCTCTAT 
31201 TTTTAGTTCC ACAGCTAAAT 
31251 TGCCACACTT GCCTACTATA 
31301 CGTATTTATT GAATTCCATT 
31351 GCTATATAGG AGAATATCTT 
314 01 TTTCTAAGAA GAAAGAAATT 
314 51 CATTTAATAG CCATTGGCTA 
31501 AGCCTATCAG ATGGAAGTGC 
31551 CTAAACATGG AAGAAACATT 
31601 TTGATGTGCT CAAGATTAGG 
31651 GGGAGAGGAA ACAATAAAAA 

317 01 ATAAATTGAT GGTTCTCAAG 
31751 GGGGCCTCAA GTTTAATTGT 

318 01 GTTTTTAAGT GGTATTTTTA 
31851 TCCAAATTCC AATTATTTTA 
31901 ATTTTGAGAT GTTTACATTT 
31951 GATACATGGT TATCCAGGTT 
32001 AAGGTCATCT TTTTAGATTA 
32051 GTGATAGGGC CTAGAAATTA 
32101 TAAACTGCAT ATAAACCATC 
32151 ATTAGAAGGC CCTCCTGGAC 
32201 ATTGTTTCCC TTTGAATTTA 
32 251 TTTCCAAGGG CATTTCCCTA 
32301 GTAGTCTTTT ACATTAAATA 
32351 GGTGGTACAG GGTCCAGTTG 
32 401 GTAAAATTAG CAAAGAGGCC 
324 51 TGACAGTTGG GGGAGGAAAG 
32501 TCGCAGGAGT ATTAGAGTCT 
32551 GAGGAAACTG G GAG CAT AG A 
3 2 601 TAAAATGGGT AGGAACAGGT 
32 651 NNNNNNNNNN NNNNNNNNNN 

327 01 NNNNNNNNNN NNNNNNNNNN 
32751 NNNNNNNNNN NNNNNNNNNN 
32801 NNNNNNNNNN NNNNNNNNNN 

328 51 NNNNNNNNNN NNNNNNNNNN 
32 901 NNNNNNNNNN NNNNNNNNNN 
32 951 NNNNNNNNNN NTGATTTTTA 



GGCACGTTGT TTACTCCAAA CAAGAATTTT 
AAAATGGATC ATTTATACTT TTTTATTTTT 
TACTATTCAC AGAGTAATGC AGTTTGCTGA 
CTCTGAGAGC TCACATTACA GTGGAGTGTG 
TTATACAGAG GTTGGGAATT ATTTTTCTCT 
CTTTCTAATC TAGTTATTTA GAT G TAG AAA 
CCCCTGAGTA TATGGTATTG TAGGTATAAA 
TTCTGTACTA GGATCTACTC ATAACTCTTC 
AGGAAACCTT TGCTGTCTTG ATAATAAACA 
CAGTCTAGTG ATCAAGGAAA TTCAGGAATT 
TGACACAAAG AACAGTGCCC TAAATTGCTA 
TATTATGACT TCTTGCTCTA CCCTCTTTCC 
GTCATAAAGA GGCCACTGTG GTTGAGTCAT 
GAACACAGAA CAAGCCGTGC TCTTCAGCCC 
AAGCCCCACC CCATCACTTC TGCTTTCATC 
CATTTGGCTG CCCATATCTG CTAAGGAGAA 
TACTAACCCA TCCCCCACTC NNNNNNNNNN 
TATTAAAGCA GAAGGGAAAG AGATATTGGT 
GTGATCTGTG TCAGCCTTTC TTTTGATCTC 
TACACTGTAG CCCCACGAGA ACACTTTGCA 
TTTGCTGCGC CCCCCCCCCA CCACTTTGCT 
TTTCCTATTC CACTGCTCTT ACCTCTAACA 
CCTGCAGTTG TAGTTATTAC ACACCCTCCT 
CACAGCCCTT GTTGTGGTTT AGCCAATATA 
TTTCATACCC TCTATGACTC TCTAATCCCC 
ATACATTATA TATATAACAA ATGTTTGATA 
CCAGAACTAA TGCCAGCAAG ATAACTTTGT 
TTTGTGCAAC AGTTTCCAAA GGGTTTTCTT 
GATTGTATCA ACTTTATGAG TATCCTACCG 
ATCTAAGGGT TCCTGGTTAC TTCACTGAAT 
AAACAACAGT TTGTTTTGAA ATAGGACTCC 
AACAGTGTTG GCCTGTTGGA ATGTGTGCAT 
GCACTCTGCT TGAGAACAAA TAACAAAAAA 
CTTTGGTCCT ATAAAGCACC TGAAAGTACT 
CTGGTCAGGG GGTCCAAAGG CTACAGCCTG 
TTTATAAAGT GTCCTAAATA AAATTTTTAT 
AAACTACTTT TTCATGCTTT GAGAGAGTTT 
AGGGGTTATT TCCTGGACTT GCACTTAACG 
TTTTCAATAT GGCATTCTGT GTGCCTCAGT 
G CAT G CAT AT ATAAATGTTA AGATTTATGG 
AAAAGAATTT TTTTAAGCTG GTATTTCTTG 
TGTAGAGTGG CTTACTTCTG GAACTTATTT 
CGCCTAGTGT ACAGTTGGCT AAAGAGTAAT 
AGTTTATTTC ATTTCATGGA TATGAACACA 
ATGCCATGTT TAAAATCAGA TTTTAAGAAT 
TCATTTACAC TCTGCTTGTG TTTTCTTTCT 
CCTCCTACAG AGCACTGCCT AAGGATTTGT 
GGATGACAAA CAGGCAAGGA AGGCCTGGAA 
CTGTGGAATG GAAGGTGAGG GAAGGGTTAG 
GTAGAAAAAA AAATAACATG CACATCAGTT 
TAAAGGAAAC AATGTTTGAT AATTATCAGA 
GCATATCCCA GAATGGAGAA CAGCATGGGT 
GTCAGGAGCT TCAAANNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TCAGGCAGTT ATGATGATAA ATGTATGGAA 



FIGURE 3K 



3 3001 ACTTCCCATT TCCTAGAGCT AAAGTGCATG TTTCTCATTC TGAAAT GT AG 
33051 GGAACATAAT CATCTGATAC CACTCACCTG ATTGTTTCTC ACTCTTCCTC 
33101 CACCATTTAC CCATCTCTTT AGCTTAATGA GTCCCCTGTG TATCTCCCAA 
33151 CTAAACAGCG GCTTACTTGC CTGTGAAATA TTCTTCTCTT GGGTAGTCTG 
33201 CTCCCTTCTC TGTCTACTCA TGCTTCAAGA TTCAACATAA GCCTCCTCTA 
33251 TGAGGCTTTC TGCACGTATG TATATGGATT TGCTTGTGTA ATGATTTCTT 
33301 CACAGATTTC ATATTGCTGA TAAATAAATA TTGTTTTGAA TAAGAAACGT 
33351 GGTTTTGTAT TTTTATCTCG ATTGTAGACT CCTTGAGACC AGTACCATGC 
334 01 TATACAATTA TTTTTCATCT ATTATAGTGT CTGGCATAGG GACATGCACA 

334 51 TATTTGGTAC AGAANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

335 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
33551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
33601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
33 651 NNNNNNNNNN NNNNNNNNNN NNNNNTAGAG AATAAACGGA ATAAATTCCA 
33 7 01 ATTAAACAGT GAAAATATTT GAGTGATGAT TTACAGAATT TTAAAGTCTT 
33751 GAGAAAGTGG AACTCAGTTG ATGGAAAAGA GTGAATGTCA AACTGAGAAC 
33801 GTCCTATTTG CTATGTTAGG GCATAGAAAG GCCATTATAG ATTGAAAAGC 
33851 AGTTGTAATG AATCTTAGGG TTAAGAAAGA TCAAGATACA GAAATATCCT 

33 901 TGAATTGAAG TTCCAAAACA ATGTTGTTTT GGTTTTTGTT TCATTTTGAA 
33951 TCCTTTCATA CTTAGGAATA CCATTTCTAG TAAAATAAAT ATTTTATGTT 

34 001 TAGTTAGAAA TTTATCTGTA TTTCATACAT ACTTAGTACT TTTGGACTAA 
34 051 GAACTGCTAT TTGAAGTATA TTTGAGGGAA TGAGTTTGAA ATTTTGGGTG 
34101 CAGGACATTA TAAAGTTGTA ACTATGAATA AATTTCAGTT ATGCTTATGC 
3 4151 ATAGTTTTAC CTAGTTTTAT TTGTCTATTT GAGTATTGTC CTTGAATTTA 
34 201 AAATTTTTTT CAGCCCCAAC TGATACACAC ACATATACAT ACATAATACA 
34 251 TGTGTGTGTG TGTAGCTTAC AGAGTGTTTA TAGGAAACTG ATTTTGTATA 
34 3 01 CTTTGGCTAC TTTGTTGTAA GTTCTAGTTT TTTTTCTTTT ATTATTAAAC 
34 351 TAGTGCACGA CATCAATGCT ATATGATTGG TGTTTCGTTG ACCTAGAAAT 
34 4 01 AATGCATGCC ATCTTCTTTT CACAGCTGTG TGCCAACCAC GATGCAAACA 
34 4 51 TGGTGAATGT ATCGGGCCAA ACAAGTGCAA GTGTCATCCT GGTTATGCTG 
34 5 01 GAAAAACCTG TAATCAAGGT AGGAAAACAG TCTGACATAA ATACACAATC 
34 551 GAAGACACCT CTATCACTCC CAAATTAAAA ATATTCTTAT CTCAAACTAC 
34 601 TTTCCATGGC TATTTTTCCA AAATATGTGA GCTGCTATTT TGCTGNNNNN 
34 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNTTTTA TCAAAGGACA 
34 7 01 ATTAAACAAA TTGTATCCTC TTATTCTCTG ATACTAATTA AAATGTATTT 
34751 TGAAGAAAAG AATCCCTGCA TCAGTAATTT AGAAGCCTTC TGGTACTCCT 
34 801 TGTTTTTCCA CTAACTAGTT GGAAATCCTT GGCCAAATAA TTAACCACCT 
34 851 CAGACCCCAG GTACTGCTTC CCTTTAATGC CAAAGTCAAG TAGGGGATTT 
34 901 GATTTGAAAT TTGGAGTTTC CTCCTAACTC TGAGCCCTTC GATTCGTGAT 

34 951 TAAATCTCCC TTCAACTACT GACCAGTTTG GAATGTTTCC ATGATAAATA 
35001 AAAAT GAT T A ATTTAGCAAG CACTTTTTAA AAAAATCAGC ATCAGTTGTT 
35051 TAAAGCAAAT ATTTATTCAA CACTCATAGT GCCCCAGGTA TTTTATATAT 
35101 ATAATCCATT TTTATCTTTA CAACAACATT AGGAAGTAGA GCATTATTTA 
35151 TTATAATNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35301 NNNNNNNNNN NNNNNNTTTT GGATAAAAAA ATATATTTGG AAATGTGATC 
35351 CTCGAACTCA TGCTACAAAG TCAGACAAGG CTTGTCTTGT TAATTAAATT 
354 01 CAGTTAAAAA TTTCCATTAT TNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
354 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
35701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NCATTATTAG GAATTAAACT 
35751 AAAGAAAGAT TAGGGTTAGA TTTCACTAAA TAGATAAGTG TCAAAAATAA 
35801 AAAAGATAAG TCAATTTTTA CTTATTTTTT AAATTATACT CCCTATCATC 
35851 TTAAATGTCA GGTGAAATAA TCATGGTGTC TAACTATCCC TTACATACTT 

35 901 ACTTGACCTC ATCTGATATA GAAATGATAT TGCTGAAATA CTACTGTTCT 
35 951 TCAGTGTCTG ATACTTATTC CAAGATACTC CTTGAGGTAT GTCATGTAAA 
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CTAATGATTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGAAAACATA 
ATGAACCATC 
TAGAAGCTGC 
AGAACTGATT 
GAAATTTACC 
ATTATCTCAG 
CATAAGTTTG 
TTAGAATCTC 
GTTTCATTAA 
TATAAAATAT 
GAAATTCATT 
TAAAGACATG 
TCCTATTTTC 
ATACCTATTT 
TCCCCAAAAC 
TGGGATTTTA 
AGTCCAGTTT 
AATCATCTCT 
TGATGGCAAG 
NNNNNNNNNN 
NNNNNNNNNN 
TAGTCCTGAA 
AGCATCTTAG 
CAGGGAAGCT 
CTTCCATAAA 
TTAACTTAAA 
ATATGCTTCA 
GAGAAATGTC 
TTAGCTTTTG 
AATTTTGTAG 
TCATGGTGTA 
GCTGCCTGGG 
GTTTAATTTA 
CTTGATCTTC 
TTAGGAAATT 
G C AC AC G T T A 
GATTCTTTGA 
TCCCTCCTCC 
CAGNNNNNNN 
GGGGAATGAT 
AAAGACATTC 
AAT ATT TATA 
ATGTTGAATT 
CCAGATACTA 
TAAGCACAAA 
CAGCCGTAGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAACTCCCA 



ATAGACACAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNGACACC 
CTCAAAATGG 
AGATGAAATC 
TATTCTAAAG 
TTCCCAATGC 
AAACATACCT 
TTTGCTCCTT 
CAGGTAGCAG 
AGTATACTTT 
TATATCAGCA 
AAAGAAGACG 
TAAATTTCTT 
CTAAAAGTAT 
AGAAAACCAG 
CTCTAAAGAA 
CATATTCCTT 
CACCTGCAAA 
TGAAAATAAA 
TATTTTGCCC 
GTTNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CACTGTGGAA 
AGAGTCTCTG 
GCCATGTGTG 
AGTTGTTAGA 
CTGTTTTCAT 
CAGATAAGGG 
AAAAAGAATT 
ATTAAAAGTA 
CGAGTTATAT 
AACAGCTTGA 
AAACAGCCAG 
GTAGCCTTGC 
CCCACTAAGT 
ACAGGCTTGG 
CCTTTCAAAG 
TCATTTGCTT 
TTGAGTTTCT 
NNNNNNNNNN 
TTTTAGTATG 
TTAGGAATTT 
AATGTCAATT 
CTTTATCATA 
ATTTTGTGGA 
AATCTAAGGG 
TTTTGAAAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAACCAATGT 



TTTTTTTTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTTTTTACAA 
TGGTTCATGA 
CACACTAGCA 
TGTGTTCTTG 
TTGTTTTCTC 
GCATAATCTT 
AAAAAACTCC 
TGTTCTGGTT 
TTTTAAAGTG 
CGGTAAAATA 
ATTAAAATAA 
TGCAGAAGCT 
T ATT GAAACA 
TTTAACATGT 
AAAATCGAAT 
AAAGAGAAGA 
AATAGTGCTG 
ACTACAGCTT 
TCCTCACAAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GCTTGAAAGG 
CATCCATAGA 
GCCTGGTTGA 
GAAAATTGTT 
TCAGGCCCAC 
AAATCAAACA 
AAATTTGAAA 
GCTTTTGGTA 
ACTTTAAGAA 
ATAAACAAGT 
AATACCAGGG 
CACAGGCTTA 
CATTTTCTGG 
GCTGATCATT 
CAGATGAAAT 
CCTTTACTTT 
GTACCAACAC 
NNNNNNNNNN 
TAAATATATC 
ACAGTGTACT 
CCTAATGTTT 
AGGAAAGAAT 
CTAGTTACAT 
CTTTAACTCT 
TGTATGGTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCATTTGATT 



ACTATTTATN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCATCCAGAA 
ACCATCAGAT 
AAAATGACAT 
TATTATCTGC 
TCTAATACTC 
TTTAAATAAG 
ACTTGATTTT 
ATTGGATGCC 
GTGAACATAA 
TTGTTGTCCA 
GTCACTTTTG 
GCATCAACTT 
AAAGTTCATC 
ACTTTTTTTT 
GTATCAAGTT 
GGATTTGATT 
ACACTGAAAA 
AAAATAAACT 
AAAGCAGTGT 
NNNNNNNNNN 
NNNNNNNNNN 
NAGTTTCATC 
GTTCCACCAG 
CTTGAGGGCC 
GTGTGAGAGC 
GATAGTGCCT 
ACATTTTAAT 
GGCTCCTTTT 
TAACTTTACA 
TATGACAGGT 
ATAACCCCCA 
GCTTAACCAG 
CAAGCTGCAT 
GTGTGATCTG 
ATTTGTTACA 
AACATACTGT 
TCTAACCTGA 
TACTTTTATT 
AAACCTCTTT 
NNNNNNNNNN 
ACT GC AT ATT 
TTATAATAAT 
TAGCATGGTT 
TGGAGTCTTT 
CTGAAAGTTG 
AATATTAAGG 
CNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNGAAG 
TGGACATTTA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CATTGATTTT 
GAAATTATTC 
GTCCCAATGG 
TATTTGATTT 
TTTTTAACTT 
TGATGCTCTT 
TCTTCCCCAG 
AAT GTT CATC 
GCCCTCAGAG 
AATGTGAGGA 
AGATTGCAGA 
TAACATGCTT 
ACTGGGTAGC 
CCTGTAACAC 
AAGAT CTTGC 
AGTAGCAAAA 
TGTAATTTCA 
TAGGTGTTTT 
GGCAGGTTCC 
NNNNNNNNNN 
NNNNNNNNNN 
TAATGGTTCT 
TTCTTGGTAC 
CTGTCAGGAT 
TGCCTAGAGA 
ACATAGCAGA 
AAAGTAGAAA 
TTCTGGAGGA 
GAACTGGAAA 
ATTCACTGAG 
GAAACTTGCA 
TGCCTTTAGA 
TTTGGAACTG 
CTTTTGGTGG 
CCTAGAACTG 
ACTCTACAAG 
ATTCTGGCAA 
TATGCATATT 
TTCCCCCAGC 
NTTTTCCATT 
CTCAGAAATG 
TTCAGAAGAA 
TATGTTTCAT 
TAGGTCAGAA 
ACTGCTTTGC 
TGGTTACCTA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTGTATGCTT 
ACTGGCTATT 
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AATAACATAA 
TTCTTTGAGT 
GTTAACATTT 
TTCTTACATT 
TAAAATCTTT 
AACTACTATT 
CGTTTACAAT 
TTGTATAATT 
CTTTTAAAAA 
CGATGGATTG 
ACAGTGATAC 
CAGAAACTAT 
AACTGTATGT 
ATTGACATCA 
ACGAGCACAT 
CAACCCCTGG 
ATTGTCCCAG 
GTCACTTCTT 
AGCCCTGAGC 
TGTGCTCTGA 
CATTGCTTTT 
ATAGCAAAAC 
AACTGAAATT 
AGTCCTCTTC 
AAAGCATACA 
AGCCTAAGTT 
CAAGTCATTA 
GCCTGTGGGG 
GTCATGCTGG 
GTTCCTCTTT 
TTAGAATAAC 
GAGAAGAAAT 
AAAGAATTAA 
TGACATATTG 
AGCGGGGAGA 
AAAATCCAGT 
TCTGCATGCT 
GCTGTGCAGC 
TCTCCCATCC 
CAATAGTAGA 
AATGTAAGTG 
ATGAAATAAT 
ATATATATTA 
AAAAAC AC C A 
ATTTGAGACA 
GGAGTATTTA 
ATTGAATGAT 
GCAGCACATA 
GGTAACATAA 
GCTTCTCACC 
TAGGAAAAGT 
AATCCTATTT 
TTTTTAGGAG 
AGTGATATTA 
CAGCTGGTAG 
ATATGCCCGT 
ACCCATTGAA 
GATTTTTTTT 
TTCATGATTC 
ATTTCGGTCG 



TGGTGTTGAA 
CTCACCCATC 
TTTTGCATAA 
TAAATGATAA 
ATGTAAATCA 
GTACAACGTG 
TCAGTGTCCT 
TGTTTTTTGT 
TGATGATCTC 
T T CAT T TATA 
ATTTTTAAGA 
TGTTTATAAA 
AATTGTTTAA 
GTTCAAAGTT 
CCCAGCTCCT 
ATCACCAAGC 
GTGGTCTGCT 
GCAAATGTTC 
ATGAGCAAGC 
AAAGCATGGA 
CTTTCTCTTT 
AACTTTTTTC 
GGGGGTTTGA 
CCCACTCTTG 
TTAGTGACCT 
CTGGTTCCAA 
ACCAATCAGT 
TTGGCGTGAG 
AAAGCTTTGT 
TAGGTCATTT 
TGAATTAAAA 
TTGATGCATT 
TTTCGGTCCN 
ACATCTGTTT 
AAGTCCAAAT 
AAGTTAGTTA 
AATAAAATTT 
TTGCATTTCC 
AAGCTCCTGA 
TGCACTTTAA 
GTTTATTGTA 
TCATTAAATT 
TAAACCTCCA 
AAGTTTTATT 
GTATGATCAA 
CTATAATTTG 
CTCCAATAGA 
TTAACTTACT 
TTGAAGACAA 
AGGACACCAG 
CTGATATTTG 
TATAGTATCT 
ACACCTGCCT 
ACACCAGTCA 
AGTACAGCTT 
TAATAACAGG 
TGTTGCATGT 
TGGACTCATG 
TTAACCCCAA 
AAGCCATTCT 



ACAGAACGTG 
TTATATTTTG 
AGCTTCTCAT 
TATTGACGAT 
TACATTTGTG 
ATAGTGTAGG 
TAAATAGAGC 
TTTCTTCCTT 
TTGTTTGGAA 
ATGTGTGTGT 
AAATTAAAGG 
GTATCTAATG 
TATATACTTG 
GACTCTTATT 
CTTGACCAAG 
CACAAGTTTG 
CTTCCCACCT 
TGAAGACTAG 
AGCAGGAATT 
GCCCTACTGT 
TCTGAATTTT 
ATATGTAGGC 
TGCCTTTTGC 
AGGTATAGAG 
TCTTTTGGTA 
GAGAGCACTT 
AACAGGGAGA 
AATAGAAACA 
AGGAAAATAA 
TATCTCTAAC 
TAGTTGAAAT 
TGTATATTTT 
NNNNNNNNNN 
GCCTATGACA 
GTGTTAAAAC 
TCTCCCGATT 
CTGATTTTTT 
ATTCTGGACC 
ATCCACCAAC 
GACTTGTATA 
AAGTTCTATC 
TTTTATTTTC 
GAGAAAAAGG 
TCTAGGTTCT 
TGACTTCATT 
GAAGTAATTT 
TTGTGATAAT 
TAATATGGCA 
GAAACTCTTA 
ATGCATTTAA 
GCAAATTTTG 
GAAGTGGCTT 
GTGGCAGATG 
GCTGTCAGGG 
TCTCAATTGC 
TAAAATAGCC 
TTTCTTCCTT 
ACAGCACATT 
AGTGCTTTTG 
CTAAAACCAC 



ATTGTTAAGT 
GTAAGCTAAT 
GTATGTTCCA 
CTATTATGCT 
CATAATTTTG 
GAAAAAAATC 
AGGTGGATTA 
CTGTGTATGA 
TTTTTTAAAA 
TGTCCTTGGG 
AAAAGAGAAA 
CATTCTTTTT 
CAAAGATTTT 
ACCTTCTTCC 
GCAGTGAACA 
CCTTCAAGGG 
GCATGGCTTC 
CCCAGGAGTG 
GCCACCTGGT 
CCTTTTAATG 
GAAATATGTT 
AAATTCAGGA 
CAAGGAGATT 
TTTTAGGTTA 
TACATTGTGT 
AGTTGCTATT 
CAGTAATTCC 
CCAAAAATAT 
TGTGGTAGCA 
C CAT T CAT AG 
TATTAATCAT 
CTACACAAGG 
NNNNNNNNNN 
TTGCCCGATT 
AAATGAAAGT 
TTTCAAGCTA 
TCCTGAAGTT 
AGTTCCCTTC 
CTACTGAAAT 
CATAATAACT 
CTTTAGTGTA 
CTATTTTATA 
AAGATAAGTA 
TTTATCAACT 
TTGTTCTGCT 
ATTTTTGAAT 
G AAC AC AG C A 
CTAGGTCAAT 
AGAAAACTGA 
TCTTTTGAAG 
ATAAACATGG 
TCATAAGGGT 
GGACAATGAT 
AATATCATCC 
TTTCCATGTT 
AG T AC AT CAT 
TCACATATTC 
GGCTTTTCTT 
CCATGGGAAC 
CAGCAGCTCC 



CTCAACTCTC 
AATGGCAACA 
GAAGCATAAT 
GTTATTTAAA 
GAGTATTTTA 
TTAAGTTGGT 
TAATCAATAT 
CTAAATAATA 
AATTGTATGA 
AAACAAGATG 
CCAGAAGAAC 
AAACTCTACC 
TAGGGCTAAA 
TGGTGTGAAG 
GCCTCTTTTC 
GTGAGCGTGC 
CTGCAGTTAT 
TCCAGCCTTC 
GCTGATTCTC 
CATCGAGTAT 
CTAAAAACAG 
GTTCCAAAGC 
TTAGAAATCA 
TCTGAAAATC 
AACATCAGAT 
TGATTATAAG 
TGCCCTATTT 
TAATGTCTTC 
TAATTTTACA 
GTTTAAGAAC 
TGTCTCTGCT 
AATTAGGGCA 
NNNNNGTAGT 
TAAGCACCAA 
TTATTTTTAA 
CTTTTCAGTG 
TAATAATAAT 
TCCCTCTCCT 
GTATTCCTGA 
GAAGCATTTG 
AGATAAGCAC 
GATTTTCCTT 
AATTTAAAAT 
TTTAAGATTT 
TATTATTGTA 
TTATTGCTTA 
TTTATAGAAA 
GAGAAAAGAA 
GGACAAAAAG 
CTCTGTACTT 
ATGACTATGG 
CATTGTGAAG 
GGCAGTCACT 
AGACCATCAG 
TTGGATACTT 
TTCCACATTT 
ATACAGTCCA 
TCCTTTCAGT 
GGAAGGATAA 
AGTCCAGATT 



FIGURE 3N 



42001 
42051 
42101 
42151 
42201 
42251 
42301 
42351 
42401 
42451 
42501 
42551 
42601 
42651 
42701 
42751 
42801 
42851 
42901 
42951 
43001 
43051 
43101 
43151 
43201 
43251 
43301 
43351 
43401 
43451 
43501 
43551 
43601 
43651 
43701 
43751 
43801 
43851 
43901 
43951 
44001 
44051 
44101 
44151 
44201 
44251 
44301 
44351 
44401 
44451 
44501 
44551 
44601 
44651 
44701 
44751 
44801 
44851 
44901 
44 951 



ATGAATACTA 
AGCCAAATTG 
GACATAGTGT 
AGCTGCCTGA 
AAGTTTCTGT 
TAAAGAAAAA 
ACTGTGTTAT 
CTTAAAACAC 
TTGCTATAAT 
CCATATTAGG 
CTCTCATTTT 
GTAGAAAATT 
GGGAGAAAAC 
AATTTTTTTT 
TAGTCAAGAG 
AAAGTATTTG 
GTCTGCTTAC 
AGTAAGATAT 
AACATAATGG 
TAAATGAGCA 
CACTAGACAT 
AATTAAATTA 
GTGGTCACAT 
AATGTTTATT 
TTTAGGTTAA 
TGATACTTAA 
TCCATTAGTG 
TGTTGGTAGG 
GTAGGAAACT 
GATTGAACAA 
TTATCCTCTG 
CAAAGCAGTC 
TTCTCAAAGT 
AAATATTCTC 
CGAATAGGGT 
TTTGAGAATA 
GAGGACGTTA 
TGTTGAGAAT 
ATGGCGACTC 
CATGAACAGG 
TATTTTTCTC 
ATTAGTGCTG 
GATCTAAATG 
GAACACTTAC 
TGCCGGATGG 
GTGCTTTGGG 
TGGCCTCCGG 
GCCTGAGGAA 
ACTCTTCTAT 
GAATAAGAAT 
GTTTACCCAT 
AAAAAAAAAA 
CCATCAGGTC 
TTTTATATTT 
TAATAATAGT 
AATATGATAC 
TTTTCACCAT 
ATAGGCTGAC 
ACTTGATTTT 
AAAGTTAGCT 



CCATAGCAAT 
TAGAAGAGCT 
CTGAAGAAAT 
AACTGTAAAC 
TATGCTTTGA 
TCCAT CAGTA 
GTTGTGGTTT 
AAAGCCTTTC 
ATCAAGAAGA 
AACAGTCATT 
TAGTAAATTC 
TCTCAGTCAT 
ATTTTTGATA 
GAAATATTTA 
ACAATAAAGT 
GTTTATTTAT 
AGTTGCATTC 
GAAAATTTTT 
AACTTGATGA 
GATCAGATTG 
ATTATCTATA 
TGCTAAATGG 
TTCTTTTTAA 
TGTCATTATG 
AACCTTTTCA 
ACCAATAAAC 
TTTCCATATG 
TTTTTTAAAG 
GAAGTTTCTT 
GAAAAACTAC 
AAAAT AT AG C 
AAATGTTTTA 
CCAAGCCTTA 
ATGTAATATG 
TACATTGTCA 
TTGGCAAACA 
GTATGCTGCT 
TTTCCTAACT 
CTGGTTGCCC 
ATCTATCTCT 
TTTCATTTTT 
AGGTTTTGTA 
AGTGTGGCCT 
GGCAGCTACA 
TTCCTGCTCA 
CTTGTTTCTG 
GGTTCTCCTA 
TTGAAAATCA 
CACGGGTGCC 
TACAAAACTA 
GTCTTCTATC 
AAGAACACTA 
TGTTTTGTAA 
CATTATAAAA 
CTAAGACTTC 
CAGAAAAGTA 
CATTTGGATG 
TCCTGAATTG 
TATTTTTACT 
AATTTTATTT 



CCTAAGGGGC 
GCCCCAACTG 
TTCCATCAGC 
ACCGAATCCA 
AAATAAACCT 
TCCTTTTCTC 
TGTTCTGTTT 
TCTAACACCA 
TTTATCAAAG 
AACTCCTCTT 
TGAAATACAA 
TCTTCCTTGA 
CTGTAATCTG 
AAATGTGATT 
AG AAAAT T GA 
TTAACCTTTC 
AGGGCAGATA 
TTAGATGTAA 
TTTAGTTAAA 
AAATAGATTA 
TTTTTTATTC 
GTACTTATGC 
TGAAGCATAT 
AGTTTTATGT 
TAATGTAAAA 
CTGAAATACA 
CCTTTTTTAA 
TTCATTTTGT 
CTTTAATCTG 
AATATTGATC 
CTATGTAGCC 
AAATCTGTAT 
AGGAGGCCAA 
TTCTTTAAAC 
GGATTGGATC 
GTTGTTAGCT 
GTTTCTAGGG 
GGCCTTTATA 
CTAGTGTAAC 
TATGTAAATG 
TAATATAGAA 
ATTTTTCTGC 
GAAGCCCCGG 
AGTGCTACTG 
AGTATGTCAA 
TTGTGCTCTG 
AACAAGATGT 
AAATAAGAGG 
AATATTAAAT 
AGTATGTTTT 
TGGGCATAGA 
CTGACTATTT 
TTTTAAATCA 
GCCTTTAGGC 
TAAAAAACAA 
GGATTCATGT 
TAAAAAATGG 
TGGTATTCAT 
GAATTATCAT 
AACATTGCAT 



CCTTTATTGG 
GGCTTCAGGT 
ATAAATACTC 
TTCCCATCAG 
AGGGATACTT 
GAGATATAGG 
TTGTTTAGTG 
TATTTGGTTT 
ACAGGTGTGA 
AGAAGATGAG 
AGTAGAAGAA 
TTTTTGCTTC 
CATGGCGCTT 
TTGTTTTAAA 
AGAACGCATA 
CGGGATTTGC 
AAAGACCTAC 
TAATTATTTG 
TGAAATATAC 
GATTTGAAAA 
AATGACATGG 
TGGTGGTTCC 
ATTTTGGGCA 
TCACTTGAGG 
TTTGTTGATT 
TGACAGTCTA 
CTGCGGAATA 
GACTTGAATG 
AAATCATATA 
ACTCTTAATT 
TTCCTTTGGG 
ACTTCCTGTA 
TGCCTTATCC 
AGAGAAAGGA 
TAGCAATAGA 
GAT AG AG CAT 
TTGATAAGGT 
AACAGAGATC 
GTACCTTCAA 
ACAGTTACTG 
AAGAATTTTA 
TTCAACCCAC 
CCCTGTAAGC 
TCTCAACGGA 
GAATCTTAAC 
AGAGCTTGCT 
GTGCGTGTGT 
CACCATTTTC 
TGATTAGGAA 
CTAATTTTTT 
AGAATGATTC 
AGGGACTGCT 
TTCATAAATA 
T T T T AT AG AA 
TTTCAAATAA 
TTATAAGGAA 
ACTTTGCCCT 
ATGATATCTG 
TTAACTGAGT 
AAGTATAGCA 



CCCAGACTGG 
TTAGTTCAGG 
CTTGTTTATG 
GAACTCACAA 
AACTGACATG 
TTTTGATTTT 
CAGAAAAATA 
TAAATGCTAT 
CTCTGCAGGA 
AAAAGTTTCC 
TATGGGAAGG 
AATTAACAGA 
TCTGTGAGGA 
AACCCTTAAC 
GGAGATGACA 
CCAGCTCGGG 
TTGGAAAATC 
GTGGTTTTAA 
AGATTTCTAA 
CAATTTTTGT 
ATTAATAGAT 
AGACAAACAT 
TTACTATTTC 
CAGTGTTTAA 
CATTTCATAT 
TTACATTTTG 
AATGAAAATA 
CAT TACT AAA 
AAATCTACCA 
TATTTCGTTC 
ATGGGAAGGT 
AAATAAAATA 
ATTTAAAGTA 
AGACATTGAA 
ACCCAAATAT 
CTCTGCCCAT 
AGGCACCTGT 
CATAAAGGTC 
ATTAATATTG 
TAGCTTGACC 
GATGGCTTAC 
ATTGTTTTCA 
ACAGGTGCAT 
TATATGCTCA 
TGTTTTATAA 
TTTGTGAAAA 
TTATACTTTT 
GATGTATAAT 
ATGGTGTTTA 
TTTTAAACAT 
CAGGCTAAAA 
CAGACAGAAT 
TTTTCTTCAT 
TTTTAGACTC 
AAAATAAATG 
GAAATGCATG 
AAAATTCTCT 
ACAATTAATG 
TGCATAAAAT 
TGGAACTTTT 
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45001 TGGAATTAGG TGAATACATG TTTAACATTG TGCAACTCAA TGGGAAATCT 
4 5051 GCTGTTCCCC TAGAGAAATT TCATGGGCAT TTGAGACAGT TACTTGGATG 
4 5101 ATTAGTTAAG AATAGCGTTT AGCTTGAGTA ATCTGGAAAA ATACCTCAAT 
4 5151 TCTTCACTTT CCTTGGCCAC TGGAAAAATT TCCAGATAAT CTATATATGA 
4 5201 TAGTTTTATA TTTGTCTTTT AATGCTTTCT TTCCTTCCCC CTTTTATTTT 
4 5251 CTTTAGATGT TTGATAGTTA AGGCCTCTTT GTCTTCATTG TTGTTCAGTG 
4 5301 TCACTGATTT CAGGACAAGT GAAAAAACAT AGCTATTTCC ATTACAGATT 
4 5351 TTGTTTTCTA CCTGTCTGAG TCAGCCAGTC ACTCTCTTTC CAGGTGCCCT 
45401 GACCTGCTCC ATGGCAAACT GTCAGTATGG CTGTGATGTT GTTAAAGGAC 
45451 AAATACGGTG CCAGTGCCCA TCCCCTGGCC TGCAGCTGGC TCCTGATGGG 
4 5501 AGGACCTGTG TAGGTGAGTT GTAAAATCAA GCATCTCTGT CAGCAGCCTC 
45551 TGTAGGATAA AGGGAGAAAG TGAAAGGTGA TGGGAATAAG GAAAAAAAAG 
4 5601 GCAATTACTT ACATCAGATA AT TAG C TAT C GTTCAGAAGA TATCAGATGT 
4 5 651 CTCAGAAGAG GACATCTCTG TGAATGGATA ATGGGAGCGT TGTTTTTTAA 
4 5701 AAAATGAGAA ATANNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 5751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 5801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 5851 NNNNNNNNAT TGTATGTGTT GTTCTTCAGG CCTCAGATTT TTTTCTGGTA 
4 5901 AATTAGGAAG GCAGTGTAGG ATTGAGAGTC TGGANNNNNN NNNNNNNNNN 
4 5951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA AGTGTTTGGT GATTAGAGTN 
4 6151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 6301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNGTCTATT 
4 6351 AGCCTTAAGG TAAGACCTGT ATCATTAGCC ATGTGAAGAA ACAGCCAAGC 
4 6401 AGCTGAAGAC ATTGCAGGGC ACTGCGGTGA GAGAGAGGTT GGCCTGTGTG 
4 64 51 AGGTTTTTCA AGGAAGAAGG TCCATGTAGC TCAAGTACAG GGTGTGAGTA 
4 6501 TGAGGGGTGG TGGCTGGAGA TATAATGGGT GAAGTAGGTG AGGGTGATTT 
4 6551 GTTGCAGGGC CTTCAAGGAC ACAGGGGAGG AATGTGGATT TAATTCAAAG 
4 6601 TGCAAGTAGA AATAGTTAAA AGATTTTGAG CACAGTAATG ATCTGATTGT 
4 6651 ATGTTTTAAT GGAATCATTT TGACAGTTGG GTTGGATATG CTTTGGAGGG 
4 6701 GGTAAGTAAC AGCAAAAGGA ACTGTTTTGA GGCTTTTGTA ACAGTCTAAC 
4 6751 CCAAGAGATG GTGGCTTCAA ATAGGGTAGT GGTGGTGGAT TAAGAGAAAA 
4 6801 GTAGACAGAT TCAAATACTT TTTTTTAGAG GTAGAATTAA CTGGACTTGA 
4 6851 TGATATGGGA GTTAAAGGAA AGAGGGTTGT TTCTAATTCA AGAATGACTC 
4 6901 ATGTTTCTAG TTTAAGCAAC TTGGTAGATA CAGGTGCAGT ATGTGCTAAT 
4 6951 ATATAGAGTG CAAGAAAATA ATAACTTGGC TTAGAAGGCT GGGTTGAAAT 
4 7 001 TTGGATATGT GGAATTTATG TTGCCTGTAG GGACAACCAG CTAGATACCG 
47051 TTTATGATGA GCCACTATAT GCTAGGCAAC CATTTGTAGT CAGTTATTTA 
4 7101 TTTTGAAACC CACTCTGTCT TAACATACCC TTGCTCTTCC TAAAATGCTA 
4 7151 TTGACTTATG TTTCTAGATG TTGATGAATG TGCTACAGGA AGAGCCTCCT 
47201 GCCCTAGATT TAGGCAATGT GTCAACACTT TTGGGAGCTA CATCTGCAAG 
4 7251 TGTCATAAAG GCTTCGATCT CATGTATATT GGAGGCAAAT ATCAATGTCA 
47 301 TGGTAATGAA ACCCAACCAT TGCTTTGTGT TGTTTCTTCC TAGAGCACTG 
47351 AAAGGTCTCG TAATTGTGGT GATGGCTGGA ATGTCAGGGG CAGGGGAGAG 
4 74 01 TACTGGCGTT AAGTTAAACC AACAGACATC CAGTTTAACC ACTGGTAGTT 
474 51 CTCAGTCTAC ATGTAGTTTA TTTCTTCTGT TTATCTGCCA ATTTTATGTA 
47 501 GATCATCACA TTGCCAAAAA AAATCATTTT TGAAACTGTA TATATTTTTT 
47 551 ATGTCATCAT ATTTATCTCC TAAATAAGTC TCTTCTTTTC CTACTTTCTG 
47 601 ATGCAGACAT AGACGAATGC TCACTTGGTC AGTATCAGTG CAGCAGCTTT 
4 7 651 GCTCGATGTT ATAACGTACG TGGGTCCTAC AAGTGCAAAT GTAAAGAAGG 
4 7701 ATACCAGGGT GATGGACTGA CTTGTGTGTG TGAGTAGCAC TTGTCTCTCA 
47 7 51 GCTTTAAATT CTAGCAGGAA ATACAGGATT ACACAAAGGC CATTGCTAGG 
4 7801 GAAAATAAGG AATAAGATTA TCAAAGAAGT ATAATTGTCA TAATTGGTTA 
47 851 TATTTGTCTT TGATTTCCAC AAACAATAAA ATCACTTGCT CAGGTACTTG 
4 7 901 TAAAAACTAA GGACTCAGTA ATACACTATA ATCTTAAGAG TATTTTAATC 
4 7 951 TCTTCACTGA AATCTCTCAA TATTTTCTTT TTAGCTAAAA AGAAATTATT 
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48001 GAGTCCTCCC TGAGAATTTG CTTTTGTTAA AATTAGAAAT CATGTTTGCC 
4 8 051 ATTAGCGTTA GATTTTGATG GTGGGATAAT CTGGATATAT TCTACATTTT 
48101 TTTCCCTCTG TTTTATGCCT CCTAACTCTG CTATTAAAAA ATCATTCCCN 
4 8151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 8201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 8251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 8301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 8351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 84 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 84 51 NNNNNNCCAT TCCCATATTT ATTTCCTCTA CATTCAGAAC TGTTTTGCTT 
4 8501 TTAAAATTTT TATAAGTTTT ATTTGGAGTA GTGGTTATGT AATAGATAAA 
48551 CATGAAAAAA ACCCAAAAGT CATAATTTAA CATACATTTC ATGTAACAGA 
4 8 601 AGATAATTTT TTAAATTTCC TCCGTAAAAT ATTTACTGAA AACCATGGTG 
48651 CATAAAATCA TGAGGATTAA AGGGTTTTTA AAGAGTCATG TAGTCATTCC 
4 8701 CAGTCTGGAA TGTAT CATTC AG GC AG ATT A AATATTGATT CTTTTTGATG 
4 8751 AAGATTATCA AGGAAAAAGA TTTCACAGAT GATCTTTGCA ACCTCTGAAA 
4 8801 ATGTCTATTT AAAAGCAGTT GTATCTATCA GTAAAGGGAA ATAAAAACCG 
4 88 51 ACCATAAAAC AATATGTAGA ACATTTACTC AAGCTGTATT TATGCAAATT 
4 8 901 GAATTTATAT AAAGTGTGGA TAAGAAAAGT ATTTGTCTTT AGACAATTCT 
4 8951 GAACACAATT TTATAATATA CACGTAGCAT AG G GAT AC AA CGTAGATTCT 
4 9001 AACTATGCAT TTGCTTATAT TAAAAAAACT TAGANNNNNN NNNNNNNNNN 
4 9051 NNNNTCTGTC AGTGTAATGC TTCACTTTTT AGGGAGGAAT GGTTACATAC 
4 9101 TGATTTAAGT CTTTGACTCT TTCAGTATAT CCCAAAAGTT ATGATTGAAC 
4 9151 CTTCAGGTCC AATTCATGTA CCAAAGGGAA ATGGTACCAT TTTAAAGGGT 
4 9201 GACACAGGAA ATAATAATTG GATTCCTGAT GTTGGAAGTA CTTGGTGGCC 
4 9251 TCCGAAGACA CCATATATTC CTCCTATCAT TACCAACAGG CCTACTTCTA 
4 9301 AGCCAACAAC AAGACCTACA CCAAAGCCAA CACCAATTCC TACTCCACCA 
4 9351 CCACCACCAC CCCTGCCAAC AGAGCTCAGA ACACCTCTAC CACCTACAAC 
4 9401 CCCAGAAAGG CCAACCACCG GACTGACAAC TATAGCACCA GCTGCCAGTA 
4 9451 CACCTCCAGG AGGGATTACA GTTGACAACA GGGTACAGAC AGACCCTCAG 
4 9501 AAACCCAGAG GAGATGTGTT CAGTAAGTCT AATAAATGTT AGCACATTTT 
4 9551 CAATAGGCTC TTTATAATGA CTTTTCAACC ACAGGCCATG CCTTGAATAA 
4 9601 GAATGAAACT CGTAAGAAGA ACTAGCTATG TAAAGT CGTA TGTCCCTATT 
4 9651 GACAAATATT ATAAAGAGCT ACATAAAGAG TCAGTCTAAT TGGGCAAGTA 
4 9701 AGAAAGAATG TATGTAGCAA TGGAAGGAAT ATTCACAAAG TCATATGGTA 
4 9751 GATGACAGCT CTTAGCCAGT ATGGGAATTC TGACATAGTT GGATTTACTT 
4 9801 GAAAACTCTC AGAGGTGGGA ACTTAACTGT ATTCCATGCT ATGTTACTTT 
4 9851 TAATCTAACC CTTCNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 9901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 9951 NNNNNNNNNN NNNNNNNNGG TGGGAAAGAT GGTTACTCAG AAG AT AC C AG 
50001 TGGTAGTATG TTAGTGAAAG AGATCTGTGT AGGACACTAT GAGAGGAGAA 
50051 TGAAGGGATA CCTCGTCAGT CTTTGGGTAG GGAATATTCA GGGAGGGCTT 
50101 CCCGAGTGAA ACGGAGCAAA GTGGATGAGG AAAGGAAGAT ATTATAGTTA 
50151 ACAGGGACAG CCCAACAAAT GCAAAGAAGT AGGAAGCAGT GTAATAGGCA 
50201 GTAGACTGCC AGTAAATTGG AGGGTCATGT CAGGGACAGA TGAGAGAAGC 
50251 TGGACAGCTA CCCCTTGGAT TGCTGTAAAC TGATAGTTAG ACCTACACTG 
50301 GGGCTGGATT AGATCAAAGA CTCCCAGATT CTTAGCAGGT GACTAGGTAA 
50351 AGGGTAGCAC CACCAATGGA GATAGAGGAT TCAGGAAGAG AAGCAAGTTC 
504 01 TGAAGGAGGA AAAGGTTGTT AATTTATTAC CAGACCAAAA TTTGCATGTT 
50451 ATAACTTTCA TCTGTTCCTC ATATTTATCT GCTTGTCCTG TGTCACTTCG 
50501 CTTATTCTTA ATTACCTTTG CATTCACTCT TCCTGTCTCC TCCTTTATAC 
50551 ATTACCAGTT TGTTTTGTTC CTTTTCTCAT TCCTCTATGA TGAAGCTAAA 
50 601 GTTTGTGCCT TTTATATTGA ATACCTTTAT AACTTTTTAT CAGGTGAAAA 
50651 AAATACTGAC ATT TAT TAT T CTTTAAAGGG GAAAANNNNN NNNNNNNNNN 
507 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
50751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
50801 NNNNNNNNNN NNNNNNNNTA ATAAAGGGGA TATCAGAGCC GCATTTTGCT 
50851 TGTATGTGAC TCTCAATTAA ATTTTCTCCT CTGGGTAATT GTCCATTTAA 
50901 AGAAAGAGGG AAATTCTCTA ATTCTGATAT AGTCATGTAT GTTTTGGATA 
50951 ATCAATATAA TTTTTAGGGG AATTAGATGG TAGCTCTCAA AATATACATT 
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51001 TTATATGTGC ATAACTATTT 
51051 TCACCCCCTC TCTCCATCTC 
51101 CTTTTTAAAT TCTCATCATT 
51151 AGTGCTTAAG TAATTATGGT 
51201 AG GG AAATTT GGCTATTTAA 
51251 TTATGATGAA AGAATACATC 
51301 TTTTAAAATG TTTCTCTTTA 
51351 ATATGAATTA AAAATGACAT 
514 01 AAATTGGGGA AATACAGAGT 
514 51 GAGAGTGTGA TCTTAAAGAC 
51501 TGCCAAGGCT GTTGCATGCC 
51551 TGCCAAAAAT TAAATTCCAC 
51601 TTAAGTCTAG CATGTGTGAG 
51651 TTGTGGTTAT AGCATATATT 
51701 TTCAGGACAA ATAACCAGTG 
517 51 TAGCACTATT TCAGTTCAAC 
51801 TCTGGCATAA CTTCTTAAAT 
51851 TAGAATTTAG CTCACCTCAT 
51901 GCCACTATTT GGCAGTCTCC 
51951 TTACATTGAA AGATATTTTT 
52001 NNNNNNNNNN NNNNNNNNNG 
52051 AAGAGAGCTT TTCCTTATAA 
t| 52101 TGTAAAGATA TTCTAAAAGA 
? i; J| 52151 ACACATCTTC AGCCCAGACC 
52201 CTGACTGCTT AACCTATCCA 
; /1 522 51 CATGTATAGA ATTGTTCCTT 
52301 CTTTCTTATC CCAGGTAATG 
W 52351 TAATGTGGAT TCATCCTTAA 
H 524 01 GGAAATCCTG TTAGCTGTAC 
f;n 524 51 TCACCAACTC CAGTGCTACC 
52501 TACAGGCCCA AAATCCTTTA 
^ 52551 TAGATTTTAG AAAATCAGTA 
p 52 601 GTAGTGAAAC ATGCTGATTT 
H 52 651 ATGTTAAGGA AAGGAAAATG 
Q 527 01 TATATTTACT AGCTTGCACA 
y ; 527 51 AATTTTTTAC TTATATTTTA 
52801 AAGTAGTTAT CTCTAGAAAG 
N 52851 AAACCAGAAA TCAGATTTTT 
Cj 52901 TTATAATTAA AAGTATTGTG 
y } 52951 TATTATATCT TCCTATGCCT 
53001 AGTACCATTG ACACTGCAGT 
53051 GGGTACAAAA CAGAAAAGTT 
53101 GTAGTTATAA TGAGCTTTGG 
53151 CTAAACCAAC ATTAATTATC 
53201 AAGTCCTCCC AATTATATCC 
53251 TAATAACTCA TCNNNNNNNN 
53301 NNNNNNNNNN NNNNNNNNNN 
53351 NNNNNNNNNN NNNNNNNNNN 
534 01 NNNNNNNNNN NNNNNNNNNN 
53451 NNNNNNNNNN NNNNNNNNNN 
53501 NNNNNNNNNN NNNNNNNNNN 
53551 NNNNNNNNNN NNNNNNNNNN 
53 601 NNNNNNNNNN NNNNNNNNNN 
53 651 NNNNNNNNNN NNNNNNNNNN 
537 01 NNNNNNNNNN NNNNNNNNNN 
537 51 NNNNNNNNNN NNNNNNNNNN 
53801 NNNNNNNNNN NNNNNNNNNN 
53 851 NNNNNNNNNN NNNNNNNNNN 
53901 NNNNNNNNNN NNNNNNNNNN 
53951 NNNNNNNNNN NNNNNNNNNN 



TCTG AAATTT ATGTTTTTGT AATTTTCTCC 
AGATACTGTC TTTTTCTCCC TATTACTCTC 
GTGATCATAC AATGGGATTT TTAATTTATG 
ATTTACATTA TTTTGCTGCC TTAGGATAAC 
TGTAAGATGA TACGCTAAAT ATTTTTTTCA 
TTTCTGAGAA TTTTAAAAAA TCCTTTCTCT 
TTTCCAGTAT TCCTCTCTAT GCCAATACAT 
TGAAGTTCAC CAATAAAATT TAGTGCAATA 
TCCAATGATG TTTGGGAGCA TTCATTATAA 
ATGCTCTGGG AAAGCATTGC CTTGATCAAA 
ACAGATAGCA TTGCTGCCTT TAGAAACTCC 
TCTCATTTCA TCTTCAGCGA TAGCTGCTTA 
AATGCTTTAG ATGCTTTTTG ACTTGCTGTC 
CTAAATAAGG CAAAGGCTCT AAGTTTTTAA 
TCTATTCACT GTGAAAACAG TAAGGAACCT 
ACCATTTCTT TACTGAATAT TTTCTTCACT 
CATCCTCATT CTCCGAGACA GGTCTGAGAA 
TTCATAAATT TTTAGGTGAT GTTAATTATG 
TTCAAATAGC CCTCCACTTT GTGTTTTTCT 
AAATTGANNN NNNNNNNNNN NNNNNNNNNN 
AAAGATTTCT TTTAATGGAA GAGGTTGTGA 
TGTATGTGCG TTGTTGCCCT GAGAAAAGAC 
AAATCAAGGA AGAAAAATAT TATAACAAGA 
TCTCCCCCAA ACTCTAGACC TGGATGTCAG 
GTTGGATGAA AATAGACGTC TCAAACTCAA 
TTCTTCCCTT AAACATCCTC TACTCTCAGC 
GGACCTCTAT CACCTGTTGC TAAGGTCTAA 
TTACTCTTCA TACATACAGT CAATATGTAA 
CTTCAAATTA TATTTAAGTG TGACCTTTTC 
ACCCTGGCCC AAGCCATCTC TCCCCTGGAA 
TCCAAATAAC TTATGGAATA TAGCATTTTT 
AGGTACACAT ACTATATAGT ACACACTGAA 
TCCTCTAGTG CTTTTACTGT GAACGTATCA 
ATATTAACTC AGAGATGATG TCTCACAGCA 
AATTTTTTAA TGTTAGCAAG ATTTAAGGCA 
ATTGGATCCT ATGATGATTA TTAAAGAAAA 
TATAAATTGA GCTCTTGGCA AATGTGAGCA 
TTAAAGTTAC GTGTACATTT GTCTATAGAG 
CTCATAGCCA GTCTGTGATC TGGGGCATAT 
TAAAACAATG TTTCCTGTAA GTATAAACAA 
TTTTGTCCAT TTGCTACTTC CAAAAGAAAG 
ATTTAAACTT TAAGCAGTTG GTTAGATATG 
TATTTTAATA TGGGGCTTAA AAATTTTCAT 
TAAGTGTGAT ATCCTTAGGA GGTCTTTGTT 
AGCATCTCCA AAGGTGACTG AAGTCCCTTC 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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54 001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 301 NNNNNNNNNN NNNNNNNNNN NNTCATCCCA CCACTGGAGC ATTTTTCCTT 

54 351 ATATTTAACC TAAATTTATG AAACTACAGT TTATGCATTT TTAATATTGT 

54 4 01 CAGATACTCA GGGTTAATGG ACACAGTTGT TCATTGTCTC CAGATCCCAT 

54 4 51 ACTCCATGTA CTCAAAGACA CTTTCAGGAG ATGACTCACT CTTCCCTGGG 

54501 CCCNNNNNNN NNNNNNNNNN NNNNNTTTTC TTAGAAATGC TTTTTTTCAA 

54551 CCCACTTAGC AATCTTGTTG ACTGGTCCTT TGAACATTCT TTATTTTCCA 

54 601 AATCTGTCAC AAATGATCGA GTCAAAGATA TATAATAGTC AAATCTGACC 

54 651 CAATACTGAT CACGGGGGGG TTTCTTTTAA CATGTTTTGT AATCTAAATC 

547 01 ATTTCTCTTT GTTGCAAAGT AGAGATTTAT GGGCAGCTAT TTAATCTCCA 

54751 GATGTTTTTT CCTCAAANNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 801 NNNNNNNNNN NNNTGGAATT TTATATTGGA TTTAATTTTT TTCATTCTAA 

54 8 51 GTAATCCTTC ATATGCTTAA CTAGCTAAAT TTTACTTGTT GGTTCTGATA 

54 901 TTCCTTCCCC AAATAATCAA AGTTAATTCA GATTATGGTT TTTTTTCTTA 

54 951 TCCTGATACA GCTTAATATA GCCATTGAAA ATATTCTCTA GAAAATTCTA 
55001 AGTCTAATAT ACTTCTAAAT TTTAAGCATG ACACTGAAAA ATAGTGAAAA 
55051 GCATAGAAAA GGAATATGAA AATAGCATTG AAACATTTGG AATTAAGGGG 
55101 TTTTAACCAC TACTAAAGAT AGAATATACT GAACCAGAAA ACCTAGTCTT 
55151 GATTTCTACC TCTATTTATG AGGTACATGA CTTTGGACTA GT AT GAGTNN 
55201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
55251 NNNNNNNNNN NNNNNNNNNN NNNNNNTCCC AGCTATCCTT TCTCACTGAG 
55301 TAGCTTCAAA GATCAGAGAT AAGGGATCTA GTGTTTGTGA ATTCTGAGTT 
55351 ACTATGAAAT GTAAAAGATT AATGTGTTTT GTTTATCACT GTAAATAACC 
55401 TTGATGGGGA GAACTTATGT GATTTAAGGT TGCAACTTTT TTTGTGTGAT 
55451 GTGAAATATT AAACATATTT TTAAAATAAN NNNNNNNNNN NNNNNNNNNN 
55501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
55551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
55601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

55 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
55701 NNNNNNNNNN NNNNNNNNNN AAAAACAATT TTAGCTTGAG GGACCCTAGG 
55751 TTTTTTTAGG TTATATGACT AGAAATATAA TAAGATGTAT TTACAGGGCT 
558 01 AATCCGGTGG GCATGATTTG AGCAACACTA AAGTTATCTG ATGTAGCGTA 
55851 AAATTCTTGT TGTTTCATTA ATGAGACAGA CTTGTGCTTC TCAAAGTGTG 
55901 TTTTATAGAG CATTATTTCT GTGAAATATT ACACAGTTGT ATGTGTTAAA 
55951 AAGTTCTTAG GTCAAATACA TTTAAGAACT CCTAGGTTTA GCTAAGTTAA 
5 6001 ATAGGTTTCT TAACTGTAGG ACTTCTCAAT ACTCCAATGT GCTAACAATA 
56051 TTGCAAATCT CTAAGAGAGG CATATAATGT ATCTTTTCCC AAGTTGTTTT 
5 6101 AACCATGGGA TTATTTGAAT CTTGGGATTA GAGTTTACTG GAATCAACTT 
56151 TAGGTCAGGC TGGGCTAAAC TAGGAAGTAT GGAGACCATA CTCCACAAAA 
56201 CCCAGTATCC CTCACTAGTT AGCTCTCGTT GCCTGGTTTT TGAGACTAAT 
5 6251 GGGTTGATCT TAGATTTAGG AATGAGGGTT GGGGAGGGCA GAAAGGTGCT 
5 6301 GCACAAACTT AG C C AGAAAA GCAGCCATAA GTGGTGGTAT GCCTGTATGG 
56351 ACACCATAGG TTTGCTGCAG GGGATTGTTA AAAAGCCACT TTTCAAAGAC 
56401 AAAACTCTGG TGTCACAGGG GTAGGAGCAA TCCCAGGGCA AAGAAACTCA 
56451 GAAAAACTTT CTGCAGAACC CAAAGTAAAG GAAGTTAGGG GGACATGCTT 
5 6501 TCTGTTCTAG CAATATATAA TGAAATCTTA TATAACTATT AAACATGTTT 
56551 TATGGAAAAT ATTTAATAAT TTTGGAAAAT ACCCATGTTA CAAAAATGTT 
56601 ACCTGAAAAG TAGAAAGCTG TACCTGCCAT TTGACTCCAA TTAGATATAA 
5 6651 AATACATGTC TGCATATGTC TGGATGTGTG TAAAAGTCTG AAAGAAAAAA 
56701 GACCAAAATA GTAACCATGA GTATATCCTG TTTATGATAT TTTGGGCGAT 
5 6751 TTTCATTGTC TTTTTGTTGT CTTTATCCAG TAATTTTTAA ACTGTTCAAC 
56801 TGTACTATCG TTGATAACAG TAATTCATTA TATCATTAAT TCCTTCAGAG 
56851 AAGGGGCAGA GTAACAGAAC CCCACTCTTC TCAGCAGGAA AGTGAGGATT 
5 6901 CTGATTCTTA ATCATGTTTA TTTTATTTTG GACATACTGA GTTTTAAAAA 
56951 TTCATTGAAG GTTTTATTAA CTTCTATTTT TAACCTTTCA GATTTTCCTT 
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57 001 TACAATTGAA TGTGATAGAT 
57051 ACGAGAGGGA ATATAACATC 
57101 CAGTGTTGTC AGAAGCCCAT 
57151 ACTAAAATAA CATAGGATGA 
572 01 TGACACATTT ACTAAATCAG 
57251 ATCNNNNNNN NNNNNNNNNN 
57 3 01 NNNNNNNNNN NNNNNNNNNN 
57351 NNNNNNNNNN NNNNNNNNNN 
57 4 01 NNNNNNNNNN NNNNNNNNNN 
57451 NNNNNNTCTG AGTAGGCACA 
57501 ACCAGCGGTG AGCTGGATGT 
57551 CCTGACAACC TGCACCTGTC 
57 601 GCAACACATG GTTANNNNNN 
57 651 NNNNNNNNNN NNNNNNNNNN 
57 7 01 NNNNNNNNNN NNNNNNNNNN 
577 51 NNNNNNNNNN NNNNNNNNNN 
57 801 NNNNNNNNNN NNNNNNNNNN 
57 851 NNNNNNNNNN NNNNNNNNNN 
57 901 GCATAATGGC AGCTCCAACC 

57 951 AC TAG CC AAA TGTAGGCTAT 
58001 GTCCACCTGA CTCATGTAAA 
58051 ATTATAGAAC ATAATGTTAT 
58101 CATATCAGAG AACCAGCCTC 
58151 GTGCTGCAAT TCCAAGTACC 
582 01 CTGATTGCTG CATGTTGTCC 
58251 GATTTTACAG AGTCTGTGGC 
58301 AATATTATGG TCTTTCTGAT 
58351 CTGCTGAGGG GTGAAAAGGT 
584 01 TAAGACCTGC AGAGACCCTG 
584 51 AATTTCTGCT AGTGTTTTGA 
58501 GATCTGCTGC TGTGAATTCT 
58551 TAATTAGATC CTATGGTAGC 

58 601 GTATAAATGG GCCCTTGGTG 
5 8651 TTTTGAAGTT ACATGCACAT 
58701 GTCCTCANNN NNNNNNNNNN 
58751 NNNNNNNNNN NNNNNNNNNN 
58 801 NNNNNNNNNN NNNNNNNNNN 
58851 NNNNNNNNNN NNNNNNNNNN 
58 901 TATTGATGTT GAATTCATTT 
58 951 TCATAAAAAG ACAGCCTATG 
59001 TCATGCTTTT TATTATGGTT 
59051 CTAGATCCTT TTGAAACAAA 
5 9101 CAGAGAAAAA TGAAAATAGA 
5 9151 AAATATTTAG AATACTGTGT 
59201 AGTTGTGATT TTAAAATAGA 
59251 AAATAATGAC AATAGTCTTT 
5 9301 TACATGATTG AT CT GAT GAG 
5 9351 NNNNNNNNNN NNNNNNNNNN 
59401 CCCAGAAAAT CTGATTTAGT 
594 51 TCATAAATAA ACATTCTATG 
5 9501 GACTTCCTAT GGGATAAAAA 
59551 GGCTTGAGTG CATTTACACC 
59601 CCCTAGACGG TGTTCATGCT 
5 9 651 GTCTTCTTAC CACTTGCATT 

597 01 CTAACCTTCA TGGGAACCAT 
59751 NNNNNNNNNN NNNNNNNNNN 

598 01 NNNNNNNNNN NNNNNNNNNN 
59851 NNNNNNNNNN NNNNNNNNNN 
59901 NNNNNNNNNN NNNNNNNNNN 
5 9 951 NNNNNNNNNN NNNNNNNNNN 



ATGTCCAATT TTAGCTAGGA AATTTGTATT 
ACTTGGACCA GGAAACAGAT TTTTTTTCCC 
TTTAACTAAA ATAACATAGG ATGTTTATTA 
TTATCTTAAG ATGTAGATTC CTAATCCTTA 
ACTTACTAGG GAAAGTATCT CTCAGTCTTT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
CTTAGTCCCC AACACTCAAC CTTTTATTCA 
GAACATGACA GACCCAGTAG GGTTCCAATG 
AGGAAGAGCC CCCTCCTTTC CTGCTCCCCT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN ATAGCAGGGC TCTTTCTAAA 
CAGCAGAACC ATATTCTGGA TGGAGTCAGA 
TTAAAATATA GCAATTGTGC CTAGGCATGT 
ATAAAAAGAA AAGGCGTTAA TAGAAGCCAG 
TAATATGTAT CAGGTGCTCT TATGAGTGGC 
ATTGTTGCTG TTATTACGCA ACATGAGACT 
AAGCAGTAAG AAGAAAATTG TTTTCTTTTA 
TGAGGTTTTT CCCCTCATCT CTCTTCTACA 
TAGTTCAGTC ACATTGTTCT TAGTCATGGA 
TCTGTGTAGT G AT AAG T AAA AAGATTGTTT 
CTTCAAAGTA GTTTGCTTTC TTGTAAACAG 
AGAAGATGCC TGATATCTCC TTGAAAATTA 
AGGAGCGAAT TGTCACTTCT CACAGGTTAG 
GAAAGTTTTC AAGATTTTTG ATTTATATTT 
TAGTAAAGAA AATATCATTC TCTCCAAAAC 
ATTGTGACCA AAGTCAGAAG TCAGATTTTT 
TTGTCAATAG AGTTATAATT TACAAGTATT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNCTAC 
ATATAGTAAT GATAACATTT CCTACTTAAT 
CTGTTTTCTT GTTCTGAGTT TATATGTTTC 
CATTACAATT TTAATGTTAT TTTTAACTAA 
TTGGTTTGCA AGTGTGAGCT GTTAGGTGCA 
AACTTGCGAT TTTATTCTAG GCTTGTTACC 
TTTATTTAGG TGTTTATAGT CTCATTAGAC 
GACCACATCA TCTCAACTTC TTTACTGTGA 
TCAGAGATGA ATCTGTCTAG ATGGGAAATT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNAAA ATAATCTGAC AAGTAGTTTC 
AAATGTACCA AAAGGATTTA GAAATCTACA 
TTATTTTAGT TCAGACCCTA TTTTAATTCA 
CTTCCATTCT TTCTTTAAAT AGATTCTTTT 
TGTCCCCAAC AGCTGGTGGG CTTCTGCTCA 
GCACTCAGTC CAAGCAGCCC TTATCAGAGA 
CTGGTGCGAA GGACTCATTC CTGGCAGAGC 
TGGCTTAGAA GAGAGGAGAA GNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNTATCC 
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60001 


TGTCTCCGGG 


60051 


TACAATACAT 


60101 


TGATTAGCAA 


60151 


CCAATTTTGC 


60201 


ATTCTTCCTA 


60251 


GTCCAACAAC 


60301 


TTCTCTAACA 


60351 


TTTGCTTTTC 


60401 


TGTNNNNNNN 


60451 


TCATAAAACT 


60501 


GCCTCTAGGT 


60551 


CATATGTACC 


60601 


TTAAGACAGT 


60651 


TGGCCTGGGT 


60701 


AGAATTTGCA 


60751 


TACTCTTGGC 


60801 


ATTTACTTCT 


60851 


TTGACTATTT 


60901 


TCCCTGGGGA 


60951 


ACAAGAACTT 


61001 


GGCCCAGTAG 


61051 


GCCTTTATCC 


3 61101 


TAGTTTATGC 


13 61151 


TGGGTAAAAC 


f| 61201 


AATATATGAA 


612 51 


GACACCCACT 


■i 61301 


ATGGTCATTT 


M 61351 


TCAGAGGCCG 


4 61401 


CTTTAAAAGC 


1] 61451 


CCTTCCTCCT 


li 61501 


CAATTCTGAT 


61551 


CATAGTACTT 


! 61601 


TTATGACGTG 


^ 61651 


CTCCCGTACT 


;:i 61701 


GGAAATTGTT 


"I 61751 


NNNNNNNNNN 


61801 


NNNNNNNNNN 


61851 


GAAGGAGGCA 


3 61901 


CTTCTCTGAT 


61951 


AATTAATGGT 


62 001 


AATATATATA 


62051 


GGGGAAAGAA 


62101 


NNNNNNNNNN 


62151 


CTCCCTTGAA 


62201 


TTCATAGCCT 


62251 


AGTAACCAAA 


62301 


TAACAGAACC 


62351 


TTTATATTTT 


62401 


TGAACCAATA 


62451 


CAATATTTAG 


62501 


TTTTTTTGGT 


62551 


GCTTGAATGA 


62601 


CTCTAAAGTG 


62651 


GGGATGCTCT 


62701 


TTAAATAATT 


62751 


CAATAGCTTC 


62801 


GTGGATTGTA 


62851 


TCTTAGAAAC 


62901 


NNNCATCTGA 


62951 


ATCTTCTAAC 



CAGTGTCTCT GAGTCTTTCT 
AAAAGAGAAA TAGGTATGTC 
TGTCTATGCT CCACCTTTCA 
TCTTTACTGT CAGCCCAAAG 
TG GG AT G AAA GCTTTGACAA 
CCTTTTCTTT TGGCTTATAA 
TGGTTAAAGA AATTTCAGAA 
TCTCTGATCA CTTAAAATTG 
NNNNNNNNNN NNNNNNNNNN 
CTTGTCATTC ATCCCTACGC 
TTTGCCACCA GAAACAGCCT 
ATTCTCTTCA ATGAACTAAA 
GTTATTTTTA ATCATTTAGT 
AATGCCATAG GAACCACCTT 
AACCTACCTT TACTAAATAA 
AAGAAGTTGA GCAGTGGGAG 
CTGTGCTTAC TGTTAATGTT 
GGGAAACTTC ATTTGTGAAC 
AAATACATAT TTAGTAGAAA 
TCCTGCTCCT TTTTGTGGCT 
C CAT TAG AAA GTACTGTGCT 
AGCTGTCACT TGCTTCAGTA 
AGAAGGCTGT AAGAAGCAAC 
ATGGACCAAT GGTATAAAGA 
CATTCATGGT TGATGTTAAT 
CTCACCTTCT TCCTGTTCAT 
CAGCATGCAA ACCTGATCTT 
CCAGTTTCCA CCAGGGATAA 
TCCTGCACAC TCTGCTTCTA 
GCACCTTCTT GCTGCCTCCC 
GTTTCAGTCC CCTCTCTTTT 
CTTCTTATGT CATAGAGAAA 
TATCCTCCCC AGCCTAGCTT 
CACTAGCATC CTCTGCTTCT 
AAAAATCNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
GGATGCATGG ACAGATGGTG 
TCTTTTTTAG TTTCTAAATT 
TCAACTGCAG AGAATAGCAG 
TTGTGTATTT AAGTTGAATA 
CAATTAGAAA AATCATGTAC 
NNNNNNNTTT AATTAACATT 
ATCAAAGAAA AGTGTTTGAA 
TGCGTTCAAC AAGTATCAAG 
ATACAAATAA CAGACATATA 
CTCTTTTATT TAGATGTCTT 
TTCTTTTTAC AATCCTAAAA 
CAAATGAAAA AAAAAATTTC 
CATATTGGCA TATTGGTGTA 
TTGTGTTTCT TGTACATGAT 
TTTGTTCTGC TACTTATTTC 
TATCGGGGGT GGGTAGTTTT 
TAGTAATCCC ATACTGGTAT 
TGGTATGGTT ATAGAGTAGA 
TGAGCACTTC TGGTCTCTAA 
ATGAAGGTAT ACATGATCAT 
CAAGAACCNN NNNNNNNNNN 
CATTTTACAA GTTGTTACTT 
TNNNNNNNNN NNNNNNNNNN 



AAAACTTTAT TAAAAGTTCC 
ACAAAAATTG GGCTCATTGT 
CCAGAGATTT AACTTTTTTG 
GTGTCTGTGG AACTTTGTAG 
ACAAGGTCTT ATTTCTGTGA 
TGAATAGATG GCTAAAACTT 
ACTAATTTCC AGCCCTTTTA 
TGATGCATAT GTCCTGCATC 
NNNNNNNNNN NNNNNNCATC 
TCCAATCTGT GTCTCTAAAA 
TCAGAATATT GGAATAATTA 
CTCCTAAGCA TAGAAGTAAT 
CCAACAACAA ATGCCAGAAC 
GTTTCCTATG AAACAGGAAG 
TGCTTACATT TTGCTATAGT 
GTGTATATGA TAATATTTAC 
TTCTGGGTAA AAATATGCAA 
ACAGGTTATA TAT GAT C AC A 
GTGCCTGGCC AGCATTTCTT 
GATTTTACCC CTGACTCCAG 
CCTCAGCCCA GTCCAACCAT 
CCTGTACATT CCCTTTCTTT 
AGGCAAGACA TATCTTTTGG 
TTCTGGGGAC ATTTTCTAAA 
TTTAGTCATA ATTCCCCTAT 
TCTCCAAGCT ACTACCAGTA 
GGCAATCCTC TGTTAATCCT 
CGTCCAAAAT ATTTACAAGG 
TCTCCAATTC CATTTGATGC 
TCCCCCTGTA AGGTCTACAG 
CATTCTCCAG AGCCATAGCT 
GCATCACTTC TCAGAGATTT 
AGCCAGTCCC TCACTCCCCA 
TTAAAGCACA CATTACCTGG 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNAATGAGT GGATGGAGTA 
GAGGAATTGT ATTGCTAGCA 
TATACCATTC TAATTCTAAG 
GTATGCATTT GTTTTTATGA 
TTAGCCCAAA GTATCAAGCA 
CAATGTCTAT TCCTGGGNNN 
TTCTGCATTG ATCAAAATAG 
TTTCACAAAC ATATTTCAAT 
CCTATTCTAA GTTCTCTTTG 
ATTGCTATTT ATATGTGATC 
AGGTAAGTTT TTTATTCATA 
ACATAGAATT ACAAAAAAAG 
ATTTCTAATA CTTAGATTAC 
TTTATCCCTC TTTAGTCCTT 
TGTGATAATA ACATAATACT 
ACTTAATATT TTCACATAAA 
TTTTCTCTTG AAACTTCAGT 
GTGTGAGGAA GGAAAAT TAG 
GCACACAAAA TTGTAAGAAC 
ATTCCTTGGA ATGTTTCCCA 
CTGCTGCTAA ATTAAATGGT 
NNNNNNNNNN NNNNNNNNNN 
GACTAATTCT TTGTGTTGCC 
NNNNNNNNNN NNNNNNNNNN 
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63001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63201 NNNNNNNACA GGCTCCCGGT GCCAGGAAAC CTTTACATCA ACACTCGATT 
63251 TGCCATTTGA TAGTCCTTCA TCTGGGAGGA AAAAAAAAGA CGGAGGGGAG 
63301 CTTGAAAAAC TGTCATAATG TCCCTGGAAT ATGGTACTTT TAAGAGTTGA 
63351 GCCTATTCCA TTTTGGAGAT GATTTATATA AGTTACAACA AAAGAAGGGG 

63 401 ACAAAAACAT GATTGTTCTA TGGAGTTTTT ATAACTTTCT GTCACAAGAA 
634 51 AGCACGCTTG TCTACAATTT TGTAATATTT CTAGTAAATA AAAGAGGCAC 
63501 TCCCCGTCTC AGAGCACCAA ATAAGGAAAG TGTAATTGGA TGTCATTGCT 
63551 GTCAGTCAGC TGGGCTATAA AAGAGAGAGT GGGGTTGCCT CATCCCCTGG 
63601 GTATCCACAG TCAGCTGTGT CCCTAGAGCT TCTTTTCTTT CATTGCTGCC 
63651 CAGCTGGGTA TATTGCAAGT AT GG AT TATA AGAGGGGAAG GGACTTCACT 
637 01 GTTTTAACGT TTGAAACAAA AAGGAAAAAA CTCAGAAGTA GTAAGCTAAA 
63751 AACAACTTGT GCAAACGTTC TGGGATTATT ACTTAATTTT AAGAATTTTT 
63801 GCTAAAAACA ATAGGAACAT CGTTGAAAAA AGAACCCCTT TGAGTGATGA 
63851 CTGTTTCATA GTGTTCTACA TCTCCTACTC CCTGCCTTAT AAAAATAAAG 
63901 CTAATTAATT GAAGTTCTGG CAAAGAGAAG GGAGTATTCC TTGGCCTTTG 
63951 ACCANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

64 001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 201 NNNNNNNNNN NNNNNNNNNN NNAATGATTT AAAGAAAT C A ATTATTTTTT 
64251 TAAAGTAGAT TCTTATGCTT TGTCCCACTT GTGTCCTTTG AGGAAAAGTG 
64301 TGAAATACTA CCTGGGGTTC TAACAGGATC TTTGNNNNNN NNNNNNNNNN 
64 3 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA 
64 4 51 CACATGTGTG TAAATAGCTG CAATTCTTTA TGAAAGTGTG TAATAGGGGC 
64 501 TTATAAGCAT GAACTCTTGA GGTACAAGAG GGAAGAATTT GATTCTGCCT 
64 551 GGGGCAGAAT CACTCCACAG AGGCCGTGAT ATTTATGCTG GGCCTTGAAG 
64 601 GAAGAAGAGA TTTTTGGCAG GTTAATTAGC AGGAAAGAGA GCATTGGGTC 
64 651 TGAGACGTGA AAATGAATGG CATGTTTGGA CAGCATTGTG TATTTTGGTG 
64 7 01 GGGTAGATTG TAAACTCTGG CCCAAGAAGT TTGAGTTTGA TCTTAAAGGA 
64751 GCCAGGGAAA CTGTTTAAGC AAGGGAGGCC ATGGTAGGAA ATGTATATTA 
64 801 GAGTGCAACA ATGTGCGGAG TTCACTGGGG GTGGGGAGTA CATGCTGTGA 
64851 TTAGAAACCA AACCATTCGC TTCCATCATA GGTATTTTCA GTTAAAACAA 
64 901 CCTTGATTTA AAGATCATAA TAATGTATCT TTCGTAGAGA TGAACTTTAG 

64 951 AATTTCATAG AGATGAATAG TTGGCTAGAA ATTGGTTGCT CTATTAAAAT 
65001 GTATTTTCCA GATAAAGACT TTTTAAGTTA TTTTTGTTGA ATTTTAGAAA 
65051 CTTTTGAAAA TCATTTTTAA AAAATGTACC CAGATCAAGG TTGCAGTATA 
65101 AATCACTGCA GAACAAAATT CGTGAATCTT CAT GT GAT AT ATAATTAAAT 
65151 ATTTTGGATT CATCCAGATG AATAAAATGA CTAGGCCTTT TCCTCCTGCC 
652 01 CTTGTTAAAG GTAATGCCTC TTTATCAATA CTTCACACAC ACACACACAC 
65251 AAAAACTCAA AACCTTTCTC TTGCAGTTCC ACGGCAACCT TCAAATGACT 
65301 TGTTTGAAAT ATTTGAAATA GAAAGAGGAG TCAGTGCAGA CGATGAAGCA 
65351 AAGGATGATC CAGGTGACAC TTACACTCTT AGTGCCATCA TAATGACATT 
65401 TTATTGTTTC TCTCCATGAA AATAACTTTT AAATGTAGAT CGTTTGGACC 

65 4 51 ATTTGGGAAA ATTACACCTG TTTTCTTTAA CAACTCAGAT TTTCTTTGTA 
65501 GTTAAATCAG TTGAGCNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
65551 NNNNNNNNNN TAGTACTTAA ACTGTAAAAA GGAAATCATA TTTTATAAGT 
65601 GATGATGTAA CATTGTTATA ATTTGGCAAC ACTTGTGTGT TCTATTTTTA 
65 651 C AG AT GAT GC ACTATTGTGC T G AGAAAAG A TTAATGTAAT TTCCCTTTGG 
65701 TCTTATGGAA AGAAGCTCTG GTTATATGGA TAGGAGAGAA CACTTGTTGT 

657 51 CTAGATACCC CTAGTTCATA GATTCATTTG TTGTTTTTTG TTGATTTTCT 

658 01 GATCCACTCA TCTTTCTGAT TATTTCCTTG TAGGTGTTCT GGTACACAGT 
65851 TGTAATTTTG ACCATGGACT TTGTGGATGG ATCAGGGAGA AAGACAATGA 
65901 CTTGCACTGG GAACCAATCA GGGACCCAGC AGGTAAAACC ATTTCATTTA 
65951 ACTTTTTCTG GTACACATTT CAATGTGATA CTATCTGAAG ACTCCACTGC 
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66001 TGCTAATCAA GTCTACTGTA 
66051 GTCAAAAAGG ACCATGCCTA 
6 6101 AAATTTGTTA GCTATCAACT 
66151 TTAAAAGGTC AACTAATTTC 
66201 GATTAATTCT TGCAAATAAT 
66251 GAATAAAAGA AGTAATATAG 
66301 CCAGTTGNNN NNNNNNNNNN 
66351 TTGAGTGGTA GTAGAAGAAC 
664 01 TCCTCTGAAA AAATTTCACT 
66451 AATCACCACC ACTCATTTTA 
66501 TGGAACTCAA AGGAGTTTTA 
66551 TAGTACTCAT AAGACATCTT 
66601 CTCAGTAGTT AAATCNNNNN 
66651 NNNNNNNNNN NNNNNNNNNN 
66701 NNNNNNNNNN NNNNNNNNNN 
66751 NNNNNNNNNN NNNNNNNNNN 
66801 NNNNNNNNNN NNNNNNNNNN 
66851 NNNNNNNNNN NNNNNNNNNN 
66901 NNNNNNNNNN NNNNNNGAAG 
66951 TGGAAGGGTG AGGAGTTTGG 
67 001 TTCTAGATTT TAAGAAGATA 
67051 TTTATTTTAA AAGCCTAGTT 
67101 ACACAGAAAA GTTAACTTGA 
67151 NNNNNNNNNN NNNNNNNNNN 
672 01 NNNNNTCATG CATTTCTAGC 
67251 CAACGCACAT TTTGAGTAAC 
67301 NNNNNNNNNN NNNNNNNNNN 
67351 NNNNNNNNNN NNNNATTTAC 
67 401 AAAATAGGAA TATTCAATAA 
67451 TTGTAAAAGA CAATATCATT 
67 501 TCTTGCATTA CTGTCTTACT 
67 551 TTATCATTAA TGTCACAGAT 
67 601 ACNNNNNNNN NNNNNNNNNN 
67 651 CCTAGCTTAT TTAAAATCTC 
677 01 TAAATTTATT TCATTGCTAC 
677 51 AGTATTTGAA AACCATCCCC 
67801 CTCTACCAAT CTTATAGGTG 
67851 TTTCTTCTAT TGTGAAAAGT 
67 901 AAANNNNNNN NNNNNNNNNN 

67 951 NNNNNNNNNN NNNNNNNNNN 

68 001 NNNNCTCTGT ACTACTCATG 
68051 ATCTGCAGCA TTATTGTGTT 
68101 GTGTGCATGT GGTAGTGTGC 
68151 GATTAGACTG ACTGACCACT 
68201 ATCTTTGGCA GTCATGTATG 
68251 ATGAATGATT CCTTCAAGTA 

683 01 ACTCAGGAGA ATTCACTGCA 
68351 TGAGTTTTAG AATTTAAATA 

684 01 AACATTTTTT TATTATTCAT 
68451 CCCAAATCTC CCTTTCTCTC 
68501 TAGAATGGTA GACAAAGAAA 
68551 AGAGCAAATT GAAAGGAAGA 
68601 GATGATTGGC TACGGTCCCT 
68 651 TTTTACCAAG AACTTAGATA 
68701 AACTGACCAA ACTACGGTTG 
68751 GTGCTGATAC AACCAGCATT 
68801 GAAGGTGAAA TGTAATGGGG 
68851 AACCAGTCAC ACATCTGAGT 
68 901 TAGAAGACTT TAGAGTTTTA 
68951 TGTGATGTTG CTGCCAGAAA 



ACACATCTCT GTGTTTACTT GATAAAGATG 
GCATTTATAT TTCATTTCTT TGAAAGAGTT 
AGCACTATAT TATGGGAGAC AAGTAGTTAA 
TCTCAAAAAG CTTGATAAAT AAAATATTTA 
TCTTAAGTTA TTTGTAGCAC CCATTCCCAG 
TGGTATATCA GCCATGGTAA TAGGCATTCC 
NNNNNNNNNN NNNNNNNNNN NNNNNNAGAT 
ACAGTGGAAA TTTCTATAAA GTTGTGATTA 
CTAACTTGGG TGCTGAAGGA AATGGTTACG 
TAACAAAAGG TAGATTTAGA CATAGTTCCT 
AATTGCACAA TTCCTATTAA TCTAAAGTTT 
TTCTTCTAGA TTCAAGATAG CTTTTCTCCC 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TTAAATCTTT TATGGATAAG GCCTACTTAA 
TTTCATTTTT TTTCCTTTTT ATACTCCTGA 
CAGAATGTGA AGGAAAAGCT TGTTATTTCA 
CTTGAATACC ATACTGGATA AGAGCACCAA 
GAGAGCTCAT GATATCTTAA ACCAGCTCNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
AGGCTCCCTG AAGATCTCTA TGGTGGTCCT 
AAGGTATTAA AAAATTAGTN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TTAAAAATTA AAATGTGAAT TAAATTCCTT 
ATTCCAAATT GGTTCATACA TTTCTTTTCA 
TAGTGCCTCA CACACAGTAC TCTCAGTCAT 
CATTTAAAGT TAACTCTGAG CAGGACAGCA 
TCTGTGAATA AGAGAGAAAC CAACATATCT 
NNNNNTACAA TATAAAATAC CTATGACAGT 
ATCAATTTTA TTTAATTGTA GCAATTAAAA 
AGAGTTGTAG CAATTTACAC TCCTACCAAC 
AACAGATATT ATAAAAGTAT TTTTTTAAAT 
AAAAGGAATC TCTTTTAATT TTACTTTGCA 
TGATCATGTT TTATCATTCA CTCGAGGAGC 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
CAGGCTCCTA G AC AG T C C AA TACTCAGAAC 
TTTGACTGTA CCACAGGCTG GCCTCAGTAG 
ATGATCTGAT TAGCAGATCA CAGACCTGTA 
GACCTGGAGT AGTTTGCTAC AGAAGGAGGG 
ACAGAATGGT GAATAATATT TATGGTCATA 
AGTTACACTG GGGAGAGTAA TTCATGCTAA 
CTTCTATGTA AACAATCTAT GTAAAATACT 
CTGTATTTTA GAACCAATTT AGTCCTTTTC 
TATATTTGTA AATGTTTAAA TTTGTTGTGA 
TGATGGAAAA ATGAGAAGAG GGTTAGAAAT 
TTTGAAATCG TTATTGGACC GTGTTAGTAC 
GATGCCTGCT TGAGTGATTG GATAACTCGT 
GGAGGATGCT TTTCTAGAAT TGGTCCAAGT 
AATAGGACAT TGAACTTAAT ATTGAAGATA 
ACTCAAATCT GGAGAGGAAA GTGACTATGT 
CGAAAAATTA GATTAATGGG TCACAATTAA 
TTGAGTCGAA GTTATGGCTC ATAGTCGAGA 
GAGGGGGAAG ACCTGACATA TCACTAAGGG 
GTGTGTAATG AAATCAATCT AAGCGAACCT 
TAT GG AC CAT CTCTATAGAA ATATGCTATG 
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69001 


TTTCAAGAAT 




69051 


ATTCTTTTTT 




69101 


TGACCTGCAT 




69151 


CCATAACGTT 




69201 


ATAATTGACT 




69251 


TAGGATTGAT 




69301 


TCTAAAAAGA 




69351 


GCATTTGACT 




69401 


TCAGATGAAA 




69451 


TTCAGAATGG 




69501 


TAGCTTGGAA 




69551 


GCAGGATATG 




69601 


GAGAATTATA 




69651 


TCATGGTTTT 




69701 


NNNNNNNNNN 




69751 


ACATGTGAAG 




69801 


NNNNNNNNNN 




69851 


NNNNNNNNNN 




69901 


NNNNNNNNNN 




69951 


NNNNNNNNNN 




70001 


NNNNNNNNNN 




70051 


NNNNNNNNNN 


CI 


70101 


CTAATTACCA 




70151 


ATAAAAGAAT 




70201 


GCAGTTCTAA 


>-m 


70251 


TTCTAAGAAA 


\l 


70301 


AGTGCATTTG 




70351 


AGTCACAGTG 




70401 


TGCTGCTGAC 




70451 


GGGACACACC 




70501 


CCTGGAACCT 


% 4 


70551 


ACTCTGTCTG 




70601 


GTATGCCAGC 




70651 


TTTTAAAATA 




70701 


TTCTACCACC 


70751 


TCTTGATTAT 




70801 


CTGTGCCTTG 


'%\ 


70851 


NNNNNNNNNN 




70901 


NNNNNNNNNN 




70951 


CTATCCACTA 




71001 


NNNNNNNNNN 




71051 


TACATACAGA 




71101 


NNNNNNNNNN 




71151 


NNNNNNNNNN 




71201 


NNNNNNNNNN 




71251 


NNNNNNNNNN 




71301 


NNNNNNNNNN 




71351 


NNNNNNNNNN 




71401 


NNNNNNNNNN 




71451 


NNNNNNNNNN 




71501 


TTTTTCTCTG 




71551 


AAGTTTTCTA 




71601 


ATTTAAAAAC 




71651 


GCAAATTTGG 




71701 


ATTGGCATAC 




71751 


GTTATACAAG 




71801 


AAAAAAAGTT 




71851 


AGTTCAGAAG 




71901 


AATTTAAAGG 




71951 


TTATAACCCA 



GAAAATCCAC TTTGGTTAAG 
TCTAATATGA GAAGAATAGA 
GGAAAGGTAT CTCCAAGCTT 
TAGCCTGGAG AAGAAATGAT 
AGGGTTAGAG TTGTTCTCTG 
GGGTAAAAAA TACAGGGAGA 
GAACAGGCTG ACATGGAAAG 
GGGTGACATA TGGCAAGATA 
GTTGGATTAG AAAACCTCTA 
AATGTGGAAA TAGTCCCAAG 
AGAAGGATGG GAAGACCAAA 
TGAGGATGGA ATTGCAGAAT 
ACCCAGCACT GATGGGAAAA 
TTATTCTAGG AAATTCAGGT 
NNNNNNNNNN NNNNNNNNNN 
GCCAGAGAAG AATATAAAAA 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
AGCTAATTAT TTACCTATTT 
ATGTGTTTGG ATATCCAATT 
TCCAAGATCC AGATTTGTAA 
ACTCTCAAAA CTCAAAACCT 
CTAGCTGCCT TTCATGGTGC 
AGATGTGAAA TTAACAATGC 
AGACATGGTG ATGCGCTGCA 
CTTCTCCTGT GTGTGTCAAA 
TGATCTCTGG AGAACAAGGA 
CCACTGATAG CTACAAGGAA 
ATGTACATAC AAGTATGTCT 
CACATAAATG TAAAATAAAT 
CCTGACTCCC CATTCACCAC 
GTTCTCAGAG AGATATTTTT 
CCTTTGTCAC TTATCGATAC 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNGC 
ACACCNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
ATGTTCTTTA TTTTTCTTTA 
TACAAAAAGA AAAATTATCC 
ATTTTTGTTT GTCTATTTTA 
AAACTTAAAT GTTCTTCAGT 
TTGCAGCAAT GCAAAGTGTA 
AAGCAACTTA CAACACTGCA 
CATATTCATG TGCATTTATG 
TCAAAACCTT CACAAATACC 
CACCAACCAC TACAGAATTA 
ACTTTCTGGT CAAGTTTTGC 



TGGTGAATCC ACTTTTGAGA 
GTTGGTCACT TAGAAGAGGA 
GTGAAAAACA GGCAAAGGAA 
GCATCAGGGT TCCTTTTCAA 
TGACCGAAAG GGGATGAGAA 
TTCATTTTGG CTCACTGTCA 
AGTGAGTTAT CTTGGTGGAA 
CTGTTAAAAG GTTTTGAGCA 
AACCAGCTCT GAGAGCCTGA 
AAATTCAGTG AGCAGGATTC 
AGATGGCAGA GAGTTGGAAG 
ATATTTTTAG AATATGTTGA 
TACTAATAAA GCTTACCTTT 
GGATGGANNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNTC 
ATCNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
AATAATTTCT GTGTATTATT 
C TAT C TAG AT TATACCACTT 
AT G GAT GAT T TTAATGATGT 
ATATTCTTAT ATGCTAATAT 
TAAGAAATAG CTGCAAAATA 
TATTAGGTTT TATCACATTC 
CTCTAAGAAA TGGAAAATTG 
GGTATGTAGC TGTGGCCCTG 
GCTAGAACTA AGGCCCTTTT 
TTAAGGGCAA CTGACTCAGC 
TTTGCTTACA TATGTATCAA 
ATTTCAGCAT TAAAACATTA 
CAACAATATA AGTGAAAGCT 
CCAGCAAGAT GCAGCCAGTT 
GGATACAATG GCATTCTCTT 
TACCTTAAGA GTTAACAGGN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNTGGGG ATAATAAGTA 
NNNNNNNNNN NNNNNNNNNN 
TGAAGCATGT CCCCTGTCAG 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNTCAT 
GTATTTTTAT TGATTTATGG 
TTTTTTAATT TGTTGCAAAT 
TGGTATTTTT ATCTTAACAT 
AAGAAATTGG CTAAATGGTT 
TTTGCTACCA TAGAAAAATG 
AATATAGTAT TCCTTTTTTT 
TGAATGTATT ACATAAGAAA 
TCTAGATTTT GTGAATTGAA 
AAAATAAATA AACTAACATT 
AGCATCTATT ATTTACCCTC 
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72001 
72051 
72101 
72151 
72201 
72251 
72301 
72351 
72401 
72451 
72501 
72551 
72601 
72651 
72701 
72751 
72801 
72851 
72901 
72951 
73001 
73051 
73101 
73151 
73201 
73251 
73301 
73351 
73401 
73451 
73501 
73551 
73601 
73651 
73701 
73751 
73801 
73851 
73901 
73951 
74001 
74051 
74101 
74151 
74201 
74251 
74301 
74351 
74401 
74451 
74501 
74551 
74601 
74651 
74701 
74751 
74801 
74851 
74901 
74951 



CCTCCCCAAA 
TAATATAATT 
TTTAATAGGT 
TTCCAACTTG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTGTGATGAT 
CCAAAATATT 
GAGAAAGATT 
TCAAGGATCT 
CGTTGATTCA 
AGAGGTTTGG 
CTGGGGAGCC 
ACCTGTGACC 
TAAAAAGTTA 
AAAGAGATGA 
CCAAAATAGA 
CCAGTGTTGA 
TGAGGGAATG 
CTTTCTCTCT 
TTTTTTTTCT 
GCCCTAGGGA 
TTTTATTAAT 
TAGATATGAT 
CTCCAAATAA 
CTCTGTATCT 
ACTCTGATTT 
TAAGTTGGGG 
TCTGCTTTAC 
AATTTAGGGT 
GGACAACGTT 
ATGTTGTCGT 
AGTGTTTGGT 
ATGAGCAAAG 
GACAGGCATA 
CTTTACAAAG 
GCTTGTTCTA 
ATTCCAGGTG 
AAAAGCTGCA 
ACCTGTGCCT 
CTCCAGGTGT 
AAGAAATGGT 
CTGACATCAA 
TTTTCCTTTC 
GTGTGAATTC 
ACTCCATTCA 
TTCATTGCAA 
TTTTTTTTTA 
TTTTTTAAAG 
ATTGTTCCTG 
CT AC CAT AAA 
CTTTCAAGTT 
GTAGGGTGGG 
CAGGATGTGG 
TCAAGACATA 
GAATGCATTT 
TGTTGGGAGA 



ATTCTTGAGA 
GTACATTTAT 
AACTTGCTTG 
TATCATTTCC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAAAAGCTG 
GTTTGGATTA 
CTTAAGACAT 
ATTTCCCTGC 
AACATCCTTA 
GGGAAACTTT 
CTTAGCACAG 
TATAGAGCTC 
ATACCAAATT 
TAGTTTATAT 
TACACATGTT 
AACATACAGT 
TCATTTGTAC 
CTTGAGAAGC 
AATGCAGTTT 
TCTGAGAACA 
ATTTGTGTTC 
ATAGATTTGT 
AATTGAAGCT 
TCCTGGTGTT 
TTTTTTTAAA 
GAG GAG AG AA 
TGTTCAGAGG 
CAGTTTTACC 
ATCATAAACT 
ATGCCTATAT 
TACTAATTAT 
TAAGTGAATG 
CAAATAAGTC 
AAAAGAAAAA 
GAAAGGGATA 
GACAATATCT 
CGCTTGGTGC 
GTCATTCAGG 
TTGTGAGAAA 
GGCCATGGCT 
GAGCGTAAGT 
ATAGGAGAAC 
AGGCTCAGAT 
GTGTCTCTTT 
CCTGTGATCA 
ATGGAAAAAA 
TTGATCAACA 
CTTTCCCTTT 
TTACCCAGGA 
CAAATTTTTT 
AACTCAGAAC 
GAAGTGCTTT 
GGTAGCAAAA 
GGGAAAATGT 
TACCTATGCC 



TATTGTGATA 
CACCAGAATT 
ACACTGAATC 
TTTATTACAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACTTATAGTG 
GTTTGTGGAG 
TATGTATTTT 
AGTTTTTTGC 
AACACCTGCT 
GGTTCAGACT 
AGGCTGGCAC 
CGACGGGACT 
TTATCATGTC 
CTGTTAACTG 
CAACATTCCA 
GTGTTTTCAG 
ATGGATCAAG 
AGCACCCCCC 
CCTTAACAAC 
TCATCAAAGT 
AT GT AT AT AT 
TTTACTATAA 
CCAGGAAGAC 
AGATTTACAC 
GAACCTTTAC 
TAAGCAGATA 
TTAATTATCC 
AAACCACCAA 
GGTGAGGGTC 
GTCTGATCAT 
TGTGTGTCTC 
TTTTCTGTAA 
ATATTTTTCT 
AAAAAAAAAA 
TTGGCACATA 
GACAGTGTCG 
TACCTCTCGG 
CACAAGGTGA 
ACACGGTGCC 
GGAGGCAAAC 
AGATCCACAA 
TCTGGGATCT 
AAAGGTTTGG 
CTGAAACCAC 
CCACCTCTCC 
AAATCTAGGA 
GAAACTTCCA 
CCTTTTAATT 
GATGGTTCTT 
TCTTTGTTTC 
TACAATTGGA 
GATGGAATAT 
AAATGGTCTA 
CATCATTTTG 
TGTATGTAGG 



CTGTAAACTG 
ATGAAATCAG 
AATTTTTCCC 
AGGGAACGAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AGACATGTAA 
GAAGGGGACA 
TAGTGATTCA 
TGTATCCTTG 
GCCTGCTAGC 
TTTAAGTTGT 
ACAGTAAGTG 
TGTTAGAGAA 
CTGTATAAAT 
AAAACTAAAT 
ACCATTTTTC 
TTGACTGAGA 
TTTCAAAATA 
ACTTCACTGC 
AGAGGGAAGA 
GGTCTATCTA 
AATAGGACAT 
AAATGTTTTA 
GGGGAGGGTT 
CAGTGTGAGA 
GTTTAGAGAT 
TAGGGCCCTT 
ACCAGAAAAT 
GCCAGATCGC 
ACAAGGTGGT 
ATTCCATTGG 
TGGCATTACT 
ACACATACCA 
TCTTCAAAAA. 
CTAAAACTCC 
CACACAGCCC 
GCAGCCAAAG 
CCGCCTCATG 
CGGGGCTGCA 
CACGGAGCAG 
ACAGATCACC 
AGGAGGCAGG 
GAATTTGAAG 
GTTTTTTTCT 
TTTCTGCTTC 
CTAACCCTGG 
ATAGAAAAGT 
AGGGTGGGGT 
CCTTAGATTT 
TTTTCTTTTT 
AGAGATCACA 
AAGCACTATT 
ATAAATTTGA 
GGGCATCCCA 
CTGGTGCTTA 
TATAGTTTAT 



AATGAATGTT 
AAGTGAAAAT 
TTAGTTAATG 
TTCTTCATGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNTAC 
ATATTTTGGC 
AGGGAAGT GG 
AGTACGTGAA 
GAACATCATT 
AGGGTAACAC 
TCACAACCTG 
CACAGAAATC 
GACACTGGAA 
TAAGCATAAG 
GGGATTACTA 
TATGGAAAAA 
TAAAATAGCC 
CTACTTGCAA 
TGCTTAACCA 
AAACTCTGTA 
AAGTGAAAGT 
ATTTATAGAA 
AATGGCTTAC 
TCCTGAATAC 
AGCTCTGCCA 
T T TT TT* TT T 
TAAATCACCA 
TCTCCCTAGA 
CAGGGGTGTT 
GAATGTGTAT 
AAATATTCAT 
TGTACTTTCC 
TAGAACTTAA 
TCTGGGAATT 
TGTCCATGCT 
TGCTTTTCCC 
CCCCAGGGGG 
CATTCAGGGG 
CTCTGGCACA 
CCCTGTGGGG 
TTGCGAGGGG 
ACCTGGGACG 
AAGCCTTGCT 
GGGCCTGATG 
TTCATCCTTC 
TAAAGATCCA 
TTTTTTGGTT 
GGTGAGCTTA 
CAAACTTTTA 
TGAAATGTTT 
GTTAAGCAGC 
CTATTCAGGA 
TCTTAAGTAA 
ACAAAATTAT 
TGAAACTTGC 
AAGAAAC CAT 
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7 5001 TTTAGAAAGA AAAGATAGAT 
7 5051 GGATTTTGTT TAATTCTTAC 
7 5101 TTTATTGTTT AGAGAAATAA 
7 5151 TACGTTTTCT CTTGTTTTAT 
7 5201 ATGTAAGAAT AATAAAAGTT 
75251 TGATTGGATT ATGGTGTTTG 
7 5301 GTTAACCACA AAGGAACTAA 
75351 ATCTTTCAGA TCTTTGGCTC 
754 01 CTCTTCTAAC TTTGGTAATC 
75451 CCTAAATCGA ATTGGTCTAT 
75501 ATTCTGAGTA ATGTGGTCTG 
75551 GAAAGAGAAG CTTCAGTGAT 
75601 TGCAAAATAA GTTTAAAATT 
75651 CTTAATAAGG AGGCATTAGA 
7 5701 CATTCTCTCC CTAGGTCGTC 
7 57 51 GGGGAGATTG GATTAGATGA 
7 5801 AGAACGCTAA CAACTCCAGA 
75851 CTCTTTTTCC AATTCTCATC 
7 5901 AGGAGAAGAG TGGGTCAGTG 
75951 GTTTTTCTGG CCTGCTTTTG 
7 6001 TGAAATACAG GGGCATCGCA 
7 6051 TTCCATCCTG TGTCTCTTTC 
g 7 6101 CATCCTCCAT CCTGATTACA 
jlj 7 6151 AGTTACGGGA GCAGTTTTTA 
^ 7 6201 ATGTGTTCGG TGTTGTACCA 
W 7 6251 TGTACAATGT GCTTGTGAAA 
%j 7 6301 TGGCCTGACC TGAACTCTGA 
;J|] 7 6351 GGTGTGTAAC ATATCAAGAT 
7 6401 GTTAAAGACA ATTATGTAGA 
7 6451 GTGTTTTGTG TAAATGTGCT 
7 6501 AT TT AC AG AC TCTAGTTGCA 
%| 7 6551 ATACATGGTG AAATGTCATC 
;« 7 6601 GGAAATTGGC AGAAGTGTTG 
7 6651 TTAATGGCCT TGAGTTTGAT 
^ 7 67 01 ACAAGAAACT GCAGTTAATT 
7 6751 AAAACCACAG CTTTTACCAT 
N 7 6801 CTTTATAAAA GTTTTGGTTG 
%l 7 6851 GAGGAAGCAT AGGGGGAAAC 
r] 7 6901 TAAATTGATA TAAGTTTAGG 
76951 GTTGTGCTGT GTCAGGATGG 
m 77 001 TGGGATGTGT TGGAACGGGA 
77 051 GGAATTCTAA GATCCATGAA 
77101 TACCAATATA TTAAAAAACA 
77151 GTTTGTCTTA ATTTCTGATT 
7 7201 CCAGGAATCA CAAAGATGAT 
7 7251 AAAATTAAAG GAACTGGGAT 
7 7301 GCAAACCATT GATGGTTTTC 
7 7 351 GGCGACCAGC TGTTCTCCAT 
774 01 GCTTAGACTA GAGTATAAGG 
7 74 51 ATACTTCATA AAAAAAGAAG 
7 7 501 TAAAAAATTA GATAAAAATT 
77551 ACCCAAGGAA AAGTAACAAA 
7 7 601 CTACTAGTAG TATGCAGTGA 
77 651 GCTTAATTTA GGCATTTCCC 
77 7 01 GGGGAAGAGC CCACCAAATG 
77 751 GCAATCCTAG CAGTATTAAA 
77 8 01 AGTATGATGG ACAGATATTT 
77 851 TGGTTTTCAA TGTTTCTTCA 
77 901 ATGGATGATG TTTCAGATTT 
77 951 CAGTTCAGAG AGATTTTCAT 



TCCATTAATA CAGTTTTTAT CAGAGTACTT 
ATATTTTTTC TTAAAACTTT TCTCAGTATT 
AACAAGATAA TCATTTTAAA TCATAGCACT 
AAGGAGCAAG GATGCTCTAT AGAAAATATA 
TTTGGTTTTT ACATAGGTAA AGCAACAGTG 
ATTCTATTCC ATTTTCAGCA AGAAAGCGAT 
GAAACATTTA AGATAGGCTG TGTGATTATG 
CTAATATCTG TTCCTTTATA TTCTATCACA 
CTTGACAAAA GTGTGCACTT TATAAACAAT 
AGCTTAGAAT GGCTTTTTAA AGAATAATTG 
ATGAACAGTT TGATGACTTC AGTTTCTACT 
ACTGGCAACT ATATTCTGTT TTTTTCCTCC 
GGTTTGGGGG AAGGTTTGCC TTTATTTTTG 
AAGGGGCAGA GGAGGCTTGA CTGGTGTGTG 
TTCAAAGGTG AAAAAAGGCG TGGTCACACT 
TGTGAGCTTG AAAAAAGGCC ACTGCTCTGA 
ACTAACAATG AACTCCTATG TTGCTCTATC 
TTCTCTCCTC TTCTCCCTTT TATCAGGCCT 
GGTCAGAAGG AAGTCTATTT GGTGACCCAG 
TGCAATCCCA ATGAACAGTG ATACCCTCCT 
GACACATCAA AGCCATCTGT GGGTGTTGCC 
AGGAAGGCAT TCAGCATGCG T GAG C CAT AC 
AGGTGCTCCT TGTAGCAAAT TATGAGAGTG 
AAAGAAATCT TTGCAGATGG CTATGATGTT 
TGAGTAGTAT TGACTTCCCT TGAGATATGA 
TTGACTTACC CTCTTCACTT AAGTTAGTTC 
CTTTTACTGC CATTCACTTT ATAAAATAAG 
ACATTTATTT TTATCTGTTT TTTTTTTCCT 
GTGGGCACGT AATCCCTCCT TAGTAGTATT 
ATTGATATTA AGTATTTACA TGTTCCAAAT 
AGGTAAAGGG CAGCTTGTGA TCTCAAAAAA 
CAGTTCCATG ACCTTATATT GGCAGCAGTA 
GGTTGTGGTA ACGGAGTGAT GAATTTTTTT 
CTCTGCAAAG GATAGGAAAC CTTTAGGAAG 
TAGAACTGTC ACTGTTTCAA GTTACACTTT 
CATAACATGG CTCTGGTAAT ATGTAGGAAG 
ATTCAGAAAA AGGATCCTGT TGCAGAGTGA 
TCCATTGGAA CAGATTTTCA CACAACGTTT 
CAGTTGTAGT TCATAACTTA TGTTGCTCAT 
GATAGGAAGC AAGTCCCATG CTTAGAGGCA 
TTTACACACA CTGGAGGAGC AGGGCAAGTT 
CCCCCAACTG TATTTCCTCC CTGCATATTT 
ATGTAACTTT TAAAAGGCAT CATTCCTGAG 
AAGTAATCAG AATATTTTCT GCTGTTTTTG 
TAAAGGGTTG GAAAAAAAGA TCTATGATGG 
TATTGAGCCT GGAGAAGAGA AGACTGAGGG 
AAGTATATGA AGGGTTGGCA CAGAGAGGGT 
ATGCACTAAG AATAGAACAA GAGGAAACTG 
GAGCATTTCT TGGCAGGGGC CATTGTTAGA 
TGTGAAAATC TCAGTATCTC TCTCTCTTTC 
TGTCTATTTA AGATGGTTAA AGATGTTCTT 
TTATAGAATT TCCCAAAAGA TGTTTTGATC 
AAATCTTTAG AACTAAATAA TTTGGACAAG 
TCTTGACCTC CTAATGGAGA GGGATTGAAA 
CTGAGCTCAC TGAAATATCT CTCCCTTATG 
GAAAAAAGGA AACTATTTAT TCCAAATGAG 
TAGTATCTCA GTAATGTCCT AGTGTGGCGG 
TGGTAAAGGT ATAAGCCTTT CATTTGTTCA 
TTTTTTTTTT AAGAGATCCT TCAAGGAACA 
CGGGTGCATT CTCTCTGCTT CGTGTGTGAC 
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78001 AAGTTATCTT GGCTGCTGAG AAAGAGTGCC CTGCCCCACA CCGGCAGACC 
78051 TTTCCTTCAC CTCATCAGTA TGATTCAGTT TCTCTTATCA ATTGGACTCT 
78101 CCCAGGTTCC ACAGAACAGT AATATTTTTT GAACAATAGG TACAATAGAA 
7 8151 GGTCTTCTGT CATTTAACCT GGTAAAGGCA GGGCTGGAGG GGGAAAATAA 
78201 ATCATTAAGC CTTTGAGTAA CGGCAGAATA TATGGCTGTA GATCCATTTT 
7 82 51 TAATGGTTCA TTTCCTTTAT GGTCATATAA CTGCACAGCT GAAGATGAAA 
7 8301 GGGGAAAATA AATGAAAATT TTACTTTTCG ATGCCAATGA TACATTGCAC 
7 8351 TAAACTGATG GAAGAAGTTA TCCAAAGTAC TGTATAACAT CTTGTTTATT 
7 8401 ATTTAATGTT TTCTAAAATA AAAAATGTTA GTGGTTTTCC AAATGGCCTA 
7 8451 ATAAAAACAA TTATTTGTAA ATAAAAACAC TGTTAGTAAT ACCAGTTGTC 
78501 TATTCTTGTT TTTTGAGTTT TGTTTTTTTT TGACTTGGAA AAAAGCATTG 
78551 AGGTAGTTAA ATGATGTTTC ACAAAAGTCA TAGTAGAATC CCTTTTACTG 
78601 TTTGGATGGT GGGAACAAAG ATGTTGCCTG CAGTATTATA CTTTCTAGGT 
78651 TATAAAACAT GAGACACTTT ATTTTTTTTA TCAGCATGAA CAGGGAAAGA 
7 8701 GATCAGAAGA TCACTATAAC CCATGCCATG C CT T AG T AAA TTGCTTTAGT 
78751 TATGTTTTAT TATCATTTCA TTGTAAACAT TTGCT (SEQ ID NO: 3) 



FEATURES: 






Genewise 


results : 


Start: 


3000 




Exon : 


3000- 


3070 


Exon : 


5248- 


5348 


Exon : 


19533 


-19583 


Exon : 


34426 


-34518 


Exon : 


39701 


-39790 


Exon : 


44102 


-44221 


Exon : 


45394 


-45513 


Exon: 


47168 


-47302 


Exon : 


47607 


-47729 


Exon : 


49126 


-49127 


Exon : 


49132 


-49522 


Exon : 


65277 


-65363 


Exon : 


65834 


-65932 


Exon : 


74008 


-74264 


Exon: 


75715 


-75807 


Stop: 


75808 





CHROMOSOME MAP POSITION: 

pi* chromosome 4 



ALLELIC VARIANTS (SNPs) : 

DNA 

Position Major Minor Domain 

3365 C T 

3411 G A 

5243 G A 

6532 T G 

9208 - G 

15833 T C 

27795 G T 

31341 A G 

33370 G A 

37788 G A 

41465 C T 

41466 A G 
41653 T G 
47666 G A 
52613 G A 



Protein 

Position Major Minor 



Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 
Intron 

Exon, coding 281 
Intron 



FIGURE 3AA 



52645 


G 


A 


In "cron 


59197 




A 


Intron 


63508 


T 


C 


Intron 


75153 


C 


G 


Intron 


76017 


c 


G 


Intron 


76033 


c 


T 


Intron 


77194 


A 


G 


Intron 


77508 


T 


A 


Intron 


77557 


G 


A 


Intron 



Context : 



DNA 

Position 

3365 CGGGAGGTGAGCTGGGCCCCGGGGCGCCCTCTCCTCCTTCCCGCGCTAATTTCACACTCA 
CTGTCTTGGGTCACTTTTCCCCGCGGGGTTTCGTGGTCAGAGAGGCGTCTCCTCCATCCA 
GAAGTTGGGCCACCGCACAGCGTGGCGCGAGGAGAGCGGTCCAGCGGCTCCGAGTGCCCG 
CCCGAGGCGGAGAGGGCGCGCCCTTGCGAGTCTGGGACCCCATCCGCGGCCCCCCGAGGG 
CGACTCGCCCCGGCTCGGGAATTAGGACTGAGGGAGAGGAGCCGCTGGAGCCTGGGATCT 
[C,T] 

GGCTCTGAGGGCGCGGTTTAGCCACCTACGCCGAGGTGACGCGCGAAACATCCCTTACCC 
GGGAAACTCCCGCGCCTGAACTAGACGGCTCTTCACTGGGGAAGCTTCCAGGCCCCCGGG 
GGGAGGCCCGGGCTCTGCTCAGGGCTCTCGGGGCCGCTCACACAGAGAGTGGGTGCGAGT 
CAGCGACTGGGCTACGGGGGAGATTTGTGGGCCTCTCCATTTGGTTTTCTTGAGGGAAGG 
AGACTCAAAATGAGGACCGGAGGGTGGGCGCTCCGTGAATGTGAGCATGAGTGTGTGGAT 

3411 TAATTTCACACTCACTGTCTTGGGTCACTTTTCCCCGCGGGGTTTCGTGGTCAGAGAGGC 
GTCTCCTCCATCCAGAAGTTGGGCCACCGCACAGCGTGGCGCGAGGAGAGCGGTCCAGCG 
GCTCCGAGTGCCCGCCCGAGGCGGAGAGGGCGCGCCCTTGCGAGTCTGGGACCCCATCCG 
CGGCCCCCCGAGGGCGACTCGCCCCGGCTCGGGAATTAGGACTGAGGGAGAGGAGCCGCT 
GGAGCCTGGGATCTCGGCTCTGAGGGCGCGGTTTAGCCACCTACGCCGAGGTGACGCGCG 
[G,A] 

AACATCCCTTACCCGGGAAACTCCCGCGCCTGAACTAGACGGCTCTTCACTGGGGAAGCT 
TCCAGGCCCCCGGGGGGAGGCCCGGGCTCTGCTCAGGGCTCTCGGGGCCGCTCACACAGA 
GAGTGGGTGCGAGTCAGCGACTGGGCTACGGGGGAGATTTGTGGGCCTCTCCATTTGGTT 
TTCTTGAGGGAAGGAGACTCAAAATGAGGACCGGAGGGTGGGCGCTCCGTGAATGTGAGC 
ATGAGTGTGTGGATGTGTGTGTGAGAGCGCACACACTGCGCCGCTCCTCAGACTCGGGCG 

52 4 3 GGCAGCCTTGAGAATTGTTGGTGTTAGCTGGTGCTATTTCTACTTAAGAAAGCATCTTTC 
CCTCCCGTTTTTTCTCCTACCCTTGTGTGTTGGGGGATGGGGTATTTAACTGTCTGAAAT 
TTGACATTAGATCATAGCAGAACTATTTCTGTGAAGGTGTTTTAACCTTAAATTTCTGCA 
GGCATAAAAGAGTTTGTAGAAAATTGTTGGGTGTGGTTGACATTTTTTCTGAAGTAATAA 
TACAGAGGAAATTACCTTTCTTCTCAAAAGTGTCCTTATTTATTTTGAATCTTTTTTTTT 
EG, A] 

GTAGGTGGCCCAGGCAAATAGTGTCATCGATTGGCCTATGTCGTTATGGTGGGAGGATTG 
ACTGCTGCTGGGGCTGGGCTCGCCAGTCTTGGGGACAGTGTCAGCGTGAGTATCAAGCCT 
GGGGACTTCAGTTCCCTGGGAGGTGTGGCTTTCCACCTTGTTCATGGCTTCACCCCACAT 
ATCAGAGGGTTCATTACTGAGCAAGGCTTGGCCTTGCAGGTCTGACTTGGGGATTTTCAG 
GTACAGTCCAGACTCCTTATTCTGCTTCTTTTCAGCTTTAGCCACCTGTATTACGGCCCA 

6532 GGGCGGGGGTAGGGGGCACTTCTGGTAAGATGATGGGAACTAAGTTGGGTTCTACATTGG 
GATATATATTTTATTGCTAATGAGGAGGAGGCTTAGAGGAAGAGAGAAGGGCAGTTACGA 
AGGCTAGAGCTGGCAATGGAGAAGCCTGCCTTAGAGATGGGTTGCTAGTGTGAGGAGTCA 
GGCAAATTTAAGTTCAGGAAAGTTAGGAGTTCCCTCTGCTATTTTAATTTTTGAGGATGC 
TTGCAATGTCTTCCTTAATTTTGTGAAAGAGGGACAGTGACAGTCACAGATTGACTCTAA 
[T,G] 

TGCACATAAAGACCACAATCTCTGGTTGGGAATAGAAAGGTAAAGGAAATGAATGTTTGC 
CTACCTGGTATGGAATTTGAGAACCAACAGATTCTAATAACCAAAATGTGAAGAAAGGAC 
CCTTCTGTTGGCCCAACACACCTACACATAACCCTCCTGAGTGAAAAATGAGTAGTTCTA 
TACCTGCAGTCTCCAGCTGTGCAAATACTTCTGATACTACAGAAGACTAAATTCCACCAG 
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GCACCATTCTTCTTTTTTGATCATCTTCCCTTAAAACAATATTGAATAGACTAACCAGTG 

9208 TCTGAGTTCTTTGGCTGTCATTGAGCAACTTGTGGTTTCTGAAGGAAATTATGTGAATTA 
GGATGGTTTTGTATCATTTATCCTTAAGAACAGGGAAAATTGAGATGTTTTCTTATGTTT 
CTGCTGGAGATTTTGGAAAGATGTGAAACCTACACCTACAGATTGACCTTGCTTAGTTAG 
CTCTGAACCTCCTGCTGCCTCTTCCACGTAAAGTGAAAATTTTGGATTCTTATCGGCTTC 
AGATAAACTTACAGGTTAGTGAAACATAGGGACTGAGATATAGTAATTCATTCTGAAGCT 
[~,G] 

TTTTGGAGTGGTCAAATAATTTTAGTTGGATAATATATATTACTGGCTAATGATTGTGGA 
TATTGGAAGTGATGAAAAAATTATTGAATTATTTCTTTCTGCATTTCAAATGAAAAGGCT 
ATTAGTTTGAGCAGAGAATTTTGATTTAGTAAACAAAATATTTAAATTTCATGTTTCATT 
TCTTTCTCCTATCTGGGTTCAGATACTCAGTCTTATAAATGGAACATGATTTATTTTTGC 
TCCCTAAACTGGTTATTAACTTCCTGTCCATAATCACAAAACTATATAGATTATATATTT 

15833 CATACATTAGCTTTGGTAGATACTAGATGCCAGGGGAGTGCAAATTTGAGAATAAGGATT 
TGGCATGGGTTATTTGCATGTTGAGAGTCAAGATTTAACTAAATTTTAAAACCAAACACT 
TCATTTGATCAATATCTTTTTAACCATTCTGTAGATTAATAATATAMTTCTCCAGAGCT 
GACATTACTTTGCTATAACATCATCAGATCACAAGATTAGGGTTGCCTTTTGTAGATGTT 
ATTCACTTACTCTGAAATGTTAGGAAATATGTCATCACAGTTTAAATTTGTAGTAATATA 
[T,C] 

ACAAAAGGAAAACACAACTAGGAATTTTGGATTTATGCTTACTTTGCCAAAAACCATGTT 
GATTTTCAAAAACCTTTAGCC 

2 77 95 ATGAAAACAGAAAGAGAAAGTTAAAATTTCAAGTAAACTGAAAACAACAATAGTGCTCAG 
AGGATTTTTTTTTAAGTGAAAAGGGATAGTGCTTAATTATGACAAATAAAAGCTAACTTG 
AGATGCACATATACAGATGCAGCCACTTATTTTGGCGGGGGACACTTCAGGAATTAAAAT 
TTAAATAGCGAGAGATCAAATAGATACTTGGTAAATGTGTCTGAATTGGATGTTCCCAGA 
CACAAAAATAAAATGAGTTATTGACAGCTCTTGGGAGACAACATTATAAAGACTAGACAC 
[G,T] 

TTATTTATTTTAACTCTATGTTCTAAATTACCATTGAGTAATTGACATTCGTATTTGACT 
ATGGTTTGTGGTTAAGTTCTTAATTGCAATAATGTTAAATAAAATGTGAAGCCCAAAGCA 
AACAACAACAAAAATTATAGCAATACTTCAACAGAGGTAATAATAATATGCTGCATCAAT 
GGTTCAGAATCCAGCATCTACATAAAACAAGCAACAGGGTAATGAAATTATTTTCTTTTC 
AAATATTCTGGCAGAGCTACTTTAGTTTTCTTAAGTTATAGATTGTGGTCTTAACTGCAA 

31341 CCACTTTGCTTATATTCTGAAAGTCTTTTGTTTCCTATTCCACTGCTCTTACCTCTAACA 
CACTGCCTCTAACACACCAACCTGCAGTTGTAGTTATTACACACCCTCCTTGGTTCTTTC 
ATCTCTCTATCACAGCCCTTGTTGTGGTTTAGCCAATATATTTTAGTTCCACAGCTAAAT 
TTTCATACCCTCTATGACTCTCTAATCCCCTGCCACACTTGCCTACTATAATACATTATA 
TATATAACAAATGTTTGATACGTATTTATTGAATTCCATTCCAGAACTAATGCCAGCAAG 
[A, G] 

TAACTTTGTGCTATATAGGAGAATATCTTTTTGTGCAACAGTTTCCAAAGGGTTTTCTTT 
TTCTAAGAAGAAAGAAATTGATTGTATCAACTTTATGAGTATCCTACCGCATTTAATAGC 
CATTGGCTAATCTAAGGGTTCCTGGTTACTTCACTGAATAGCCTATCAGATGGAAGTGCA 
AACAACAGTTTGTTTTGAAATAGGACTCCCTAAACATGGAAGAAACATTAACAGTGTTGG 
CCTGTTGGAATGTGTGCATTTGATGTGCTCAAGATTAGGGCACTCTGCTTGAGAACAAAT 

3337 0 CCACTCACCTGATTGTTTCTCACTCTTCCTCCACCATTTACCCATCTCTTTAGCTTAATG 
AGTCCCCTGTGTATCTCCCAACTAAACAGCGGCTTACTTGCCTGTGAAATATTCTTCTCT 
TGGGTAGTCTGCTCCCTTCTCTGTCTACTCATGCTTCAAGATTCAACATAAGCCTCCTCT 
ATGAGGCTTTCTGCACGTATGTATATGGATTTGCTTGTGTAATGATTTCTTCACAGATTT 
CATATTGCTGATAAATAAATATTGTTTTGAATAAGAAACGTGGTTTTGTATTTTTATCTC 
[G, A] 

ATTGTAGACTCCTTGAGACCAGTACCATGCTATACAATTATTTTTCATCTATTATAGTGT 
CTGGCATAGGGACATGCACATATTTGGTACAGAA 

37 78 8 GCCCTGTCAGGATCAGGGAAGCTGCCATGTGTGGCCTGGTTGAGTGTGAGAGCTGCCTAG 
AGACTTCCATAAAAGTTGTTAGAGAAAATTGTTGATAGTGCCTACATAGCAGATTAACTT 
AAACTGTTTTCATTCAGGCCCACACATTTTAATAAAGTAGAAAATATGCTTCACAGATAA 
GGGAAATCAAACAGGCTCCTTTTTTCTGGAGGAGAGAAATGTCAAAAAGAATTAAATTTG 
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AAATAACTTTACAGAACTGGAAATTAGCTTTTGATTAAAAGTAGCTTTTGGTATATGACA 
[G,A] 

GTATTCACTGAGAATTTTGTAGCGAGTTATATACTTTAAGAAATAACCCCCAGAAACTTG 
CATCATGGTGTAAACAGCTTGAATAAACAAGTGCTTAACCAGTGCCTTTAGAGCTGCCTG 
GGAAACAGCCAGAATACCAGGGCAAGCTGCATTTTGGAACTGGTTTAATTTAGTAGCCTT 
GCCACAGGCTTAGTGTGATCTGCTTTTGGTGGCTTGATCTTCCCCACTAAGTCATTTTCT 
GGATTTGTTACACCTAGAACTGTTAGGAAATTACAGGCTTGGGCTGATCATTAACATACT 

414 65 TTTATTTCTAGGTTCTTTTATCAACTTTTAAGATTTATTTGAGACAGTATGATCAATGAC 
TTCATTTTGTTCTGCTTATTATTGTAGGAGTATTTACTATAATTTGGAAGTAATTTATTT 
TTGAATTTATTGCTTAATTGAATGATCTCCAATAGATTGTGATAATGAACACAGCATTTA 
TAGAAAGCAGCACATATTAACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGTA 
ACATAATTGAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGA 
[C,T] 

ACCAGATGCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAA 
TTTTGATAAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATA 
AGGGTCATTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAG 
TCACTAGTGATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCT 
GGTAGAGTACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATA 

414 66 TTATTTCTAGGTTCTTTTATCAACTTTTAAGATTTATTTGAGACAGTATGATCAATGACT 
TCATTTTGTTCTGCTTATTATTGTAGGAGTATTTACTATAATTTGGAAGTAATTTATTTT 
TGAATTTATTGCTTAATTGAATGATCTCCAATAGATTGTGATAATGAACACAGCATTTAT 
AGAAAGCAGCACATATTAACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGTAA 
CATAATTGAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGAC 
[A,G] 

CCAGATGCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAAT 
TTTGATAAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATAA 
GGGTCATTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAGT 
CACTAGTGATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCTG 
GTAGAGTACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATAA 

41653 AGCACATATTAACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGTAACATAATT 
GAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGACACCAGAT 
GCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAATTTTGAT 
AAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATAAGGGTCA 
TTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAGTCACTAG 
[T,G] 

GATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCTGGTAGAGT 
ACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATAACAGGTAA 
AATAGCCAGTACATCATTTCCACATTTACCCATTGAATGTTGCATGTTTTCTTCCTTTCA 
CATATTCATACAGTCCAGATTTTTTTTTGGACTCATGACAGCACATTGGCTTTTCTTTCC 
TTTCAGTTTCATGATTCTTAACCCCAAAGTGCTTTTGCCATGGGAACGGAAGGATAAATT 

47 666 GTGGTGATGGCTGGAATGTCAGGGGCAGGGGAGAGTACTGGCGTTAAGTTAAACCAACAG 
ACATCCAGTTTAACCACTGGTAGTTCTCAGTCTACATGTAGTTTATTTCTTCTGTTTATC 
TGCCAATTTTATGTAGATCATCACATTGCCAAAAAAAATCATTTTTGAAACTGTATATAT 
TTTTTATGTCATCATATTTATCTCCTAAATAAGTCTCTTCTTTTCCTACTTTCTGATGCA 

GACATAGACGAATGCTCACTTGGTCAGTATCAGTGCAGCAGCTTTGCTCGATGTTATAAC 
[ G , A ] 

TACGTGGGTCCTACAAGTGCAAATGTAAAGAAGGATACCAGGGTGATGGACTGACTTGTG 
TGTGTGAGTAGCACTTGTCTCTCAGCTTTAAATTCTAGCAGGAAATACAGGATTACACAA 
AGGCCATTGCTAGGGAAAAT7VAGGAATAAGATTATCAAAGAAGTATAATTGTCATAATTG 
GTTATATTTGTCTTTGATTTCCACAAACAATAAAATCACTTGCTCAGGTACTTGTAAAAA 
CTAAGGACTCAGTAATACACTATAATCTTAAGAGTATTTTAATCTCTTCACTGAAATCTC 

52 613 AGGTAATGGGACCTCTATCACCTGTTGCTAAGGTCTAATAATGTGGATTCATCCTTAATT 
ACTCTTCATACATACAGT CAAT ATGT AAGGAAAT CCTGTT AGCTGT ACCTT CAAATT ATA 
TTTAAGTGTGACCTTTTCTCACCAACTCCAGTGCTACCACCCTGGCCCAAGCCATCTCTC 
CCCTGGAATACAGGCCCAAAATCCTTTATCCAAATAACTTATGGAATATAGCATTTTTTA 



FIGURE 3DD 



GATTTTAGAAAATCAGTAAGGTACACATACTATATAGTACACACTGAAGTAGTGAAACAT 
[ G / A ] 

CTGATTTTCCTCTAGTGCTTTTACTGTGAACGTATCAATGTTAAGGAAAGGAAAATGATA 
TTAACTCAGAGATGATGTCTCACAGCATATATTTACTAGCTTGCACAAATTTTTTAATGT 
TAGCAAGATTTAAGGCAAATTTTTTACTTATATTTTAATTGGATCCTATGATGATTATTA 
AAGAAAAAAGTAGTTATCTCTAGAAAGTATAAATTGAGCTCTTGGCAAATGTGAGCAAAA 
CCAGAAATCAGATTTTTTTAAAGTTACGTGTACATTTGTCTATAGAGTTATAATTAAAAG 

5264 5 GTCTAATAATGTGGATTCATCCTTAATTACTCTTCATACATACAGTCAATATGTAAGGAA 
ATCCTGTTAGCTGTACCTTCAAATTATATTTAAGTGTGACCTTTTCTCACCAACTCCAGT 
GCTACCACCCTGGCCCAAGCCATCTCTCCCCTGGAATACAGGCCCAAAATCCTTTATCCA 
AATAACTTATGGAATATAGCATTTTTTAGATTTTAGAAAATCAGTAAGGTACACATACTA 
TATAGTACACACTGAAGTAGTGAAACATGCTGATTTTCCTCTAGTGCTTTTACTGTGAAC 
[G,A] 

TATCAATGTTAAGGAAAGGAAAATGATATTAACTCAGAGATGATGTCTCACAGCATATAT 
TTACTAGCTTGCACAAATTTTTTAATGTTAGCAAGATTTAAGGCAAATTTTTTACTTATA 
TTTTAATTGGATCCTATGATGATTATTAAAGAAAAAAGTAGTTATCTCTAGAAAGTATAA 
ATTGAGCTCTTGGCAAATGTGAGCAAAACCAGAAATCAGATTTTTTTAAAGTTACGTGTA 
CATTTGTCTATAGAGTTATAATTAAAAGTATTGTGCTCATAGCCAGTCTGTGATCTGGGG 

5 9197 CTACTATTGATGTTGAATTCATTTATATAGTAATGATAACATTTCCTACTTAATTCATAA 
AAAGACAGCCTATGCTGTTTTCTTGTTCTGAGTTTATATGTTTCTCATGCTTTTTATTAT 
GGTTCATTACAATTTTAATGTTATTTTTAACTAACTAGATCCTTTTGAAACAAATTGGTT 
TGCAAGTGTGAGCTGTTAGGTGCACAGAGAAAAATGAAAATAGAAACTTGCGATTTTATT 
CTAGGCTTGTTACCAAATATTTAGAATACTGTGTTTTATTTAGGTGTTTATAGTCTCATT 
[~,A] 

GACAGTTGTGATTTTAAAATAGAGACCACATCATCTCAACTTCTTTACTGTGAAAATAAT 
GACAATAGTCTTTTCAGAGATGAATCTGTCTAGATGGGAAATTTACATGATTGATCTGAT 
GAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
AAAATAATCTGACAAGTAGTTTCCCCAGAAAATCTGATTTAGTAAATGTACCAAAAGGAT 
TTAGAAATCTACATCATAAATAAACATTCTATGTTATTTTAGTTCAGACCCTATTTTAAT 

63508 ACAGGCTCCCGGTGCCAGGAAACCTTTACATCAACACTCGATTTGCCATTTGATAGTCCT 
TCATCTGGGAGGAAAAAAAAAGACGGAGGGGAGCTTGAAAAACTGTCATAATGTCCCTGG 
AATATGGTACTTTTAAGAGTTGAGCCTATTCCATTTTGGAGATGATTTATATAAGTTACA 
ACAAAAGAAGGGGACAAAAACATGATTGTTCTATGGAGTTTTTATAACTTTCTGTCACAA 
GAAAGCACGCTTGTCTACAATTTTGTAATATTTCTAGTAAATAAAAGAGGCACTCCCCGT 
[T,C] 

TCAGAGCACCAAATAAGGAAAGTGTAATTGGATGTCATTGCTGTCAGTCAGCTGGGCTAT 
AAAAGAGAGAGTGGGGTTGCCTCATCCCCTGGGTATCCACAGTCAGCTGTGTCCCTAGAG 
CTTCTTTTCTTTCATTGCTGCCCAGCTGGGTATATTGCAAGTATGGATTATAAGAGGGGA 
AGGGACTTCACTGTTTTAACGTTTGAAACAAAAAGGAAAAAACTCAGAAGTAGTAAGCTA 
AAAACAACTTGTGCAAACGTTCTGGGATTATTACTTAATTTTAAGAATTTTTGCTAAAAA 

75153 AAGACATAGGTAGCAAAAAAATGGTCTAGGGCATCCCAACAAAATTATGAATGCATTTGG 
GAAAATGTCAT CATTTTGCTGGTGCTTATGAAACTTGCTGTTGGGAGATACCTATGCCTG 
TATGTAGGTATAGTTTATAAGAAACCATTTTAGAAAGAAAAGATAGATTCCATTAATACA 
GTTTTTATCAGAGTACTTGGATTTTGTTTAATTCTTACATATTTTTTCTTAAAACTTTTC 
TCAGTATTTTTATTGTTTAGAGAAATAAAACAAGATAATCATTTTAAATCATAGCACTTA 
[C,G] 

GTTTTCTCTTGTTTTATAAGGAGCAAGGATGCTCTATAGAAAATATAATGTAAGAATAAT 
AAAAGTTTTTGGTTTTTACATAGGTAAAGCAACAGTGTGATTGGATTATGGTGTTTGATT 
CTATTCCATTTTCAGCAAGAAAGCGATGTTAACCACAAAGGAACTAAGAAACATTTAAGA 
TAGGCTGTGTGATTATGATCTTTCAGATCTTTGGCTCCTAATATCTGTTCCTTTATATTC 
TATCACACTCTTCTAACTTTGGTAATCCTTGACAAAAGTGTGCACTTTATAAACAATCCT 

7 6017 CGTCTTCAAAGGTGAAAAAAGGCGTGGTCACACTGGGGAGATTGGATTAGATGATGTGAG 
CTTGAAAAAAGGCCACTGCTCTGAAGAACGCTAACAACTCCAGAACTAACAATGAACTCC 
TATGTTGCTCTATCCTCTTTTTCCAATTCTCATCTTCTCTCCTCTTCTCCCTTTTATCAG 
GCCTAGGAGAAGAGTGGGTCAGTGGGTCAGAAGGAAGTCTATTTGGTGACCCAGGTTTTT 



FIGURE 3 EE 



CTGGCCTGCTTTTGTGCAATCCCAATGAACAGTGATACCCTCCTTGAAATACAGGGGCAT 
[C,G] 

GCAGACACATCAAAGCCATCTGTGGGTGTTGCCTTCCATCCTGTGTCTCTTTCAGGAAGG 
CATTCAGCATGCGTGAGCCATACCATCCTCCATCCTGATTACAAGGTGCTCCTTGTAGCA 
AATTATGAGAGTGAGTTACGGGAGCAGTTTTTAAAAGAAATCTTTGCAGATGGCTATGAT 
GTTATGTGTTCGGTGTTGTACCATGAGTAGTATTGACTTCCCTTGAGATATGATGTACAA 
TGTGCTTGTGAAATTGACTTACCCTCTTCACTTAAGTTAGTTCTGGCCTGACCTGAACTC 

7 6033 AAAAGGCGTGGTCACACTGGGGAGATTGGATTAGATGATGTGAGCTTGAAAAAAGGCCAC 
TGCTCTGAAGAACGCTAACAACTCCAGAACTAACAATGAACTCCTATGTTGCTCTATCCT 
CTTTTTCCAATTCTCATCTTCTCTCCTCTTCTCCCTTTTATCAGGCCTAGGAGAAGAGTG 
GGTCAGTGGGTCAGAAGGAAGTCTATTTGGTGACCCAGGTTTTTCTGGCCTGCTTTTGTG 
CAATCCCAATGAACAGTGATACCCTCCTTGAAATACAGGGGCATCGCAGACACATCAAAG 
[C,T] 

CATCTGTGGGTGTTGCCTTCCATCCTGTGTCTCTTTCAGGAAGGCATTCAGCATGCGTGA 
GCCATACCATCCTCCATCCTGATTACAAGGTGCTCCTTGTAGCAAATTATGAGAGTGAGT 
TACGGGAGCAGTTTTTAAAAGAAATCTTTGCAGATGGCTATGATGTTATGTGTTCGGTGT 
TGTACCATGAGTAGTATTGACTTCCCTTGAGATATGATGTACAATGTGCTTGTGAAATTG 
ACTTACCCTCTTCACTTAAGTTAGTTCTGGCCTGACCTGAACTCTGACTTTTACTGCCAT 

7 7194 AACGTTTTAAATTGATATAAGTTTAGGCAGTTGTAGTTCATAACTTATGTTGCTCATGTT 
GTGCTGTGTCAGGATGGGATAGGAAGCAAGTCCCATGCTTAGAGGCATGGGATGTGTTGG 
AACGGGATTTACACACACTGGAGGAGCAGGGCAAGTTGGAATTCTAAGATCCATGAACCC 
CCAACTGTATTTCCTCCCTGCATATTTTACCAATATATTAAAAAACAATGTAACTTTTAA 
AAGGCATCATTCCTGAGGTTTGTCTTAATTTCTGATTAAGTAATCAGAATATTTTCTGCT 
[A,Gj 

TTTTTGCCAGGAATCACAAAGATGATTAAAGGGTTGGAAAAAAAGATCTATGATGGAAAA 
TTAAAGGAACTGGGATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATG 
GTTTTCAAGTATATGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGC 
ACTAAGAATAGAACAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGC 
AGGGGCCATTGTTAGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTC 

77508 TCACAAAGATGATTAAAGGGTTGGAAAAAAAGATCTATGATGGAAAATTAAAGGAACTGG 
GATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATGGTTTTCAAGTATA 
TGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGCACTAAGAATAGAA 
CAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGCAGGGGCCATTGTT 
AGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTCTCTTTCTAAAAAA 
[T,A] 

TAGATAAAAATTTGTCTATTTAAGATGGTTAAAGATGTTCTTACCCAAGGAAAAGTAACA 
AATTATAGAATTTCCCAAAAGATGTTTTGATCCTACTAGTAGTATGCAGTGAAAATCTTT 
AGAACTAAATAATTTGGACAAGGCTTAATTTAGGCATTTCCCTCTTGACCTCCTAATGGA 
GAGGGATTGAAAGGGGAAGAGCCCACCAAATGCTGAGCTCACTGAAATATCTCTCCCTTA 
TGGCAATCCTAGCAGTATTAAAGAAAAAAGGAAACTATTTATTCCAAATGAGAGTATGAT 

7 7557 AAAGGAACTGGGATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATGGT 
TTTCAAGTATATGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGCAC 
TAAGAATAGAACAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGCAG 
GGGCCATTGTTAGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTCTC 
TTTCTAAAAAATTAGATAAAAATTTGTCTATTTAAGATGGTTAAAGATGTTCTTACCCAA 
[G, A] 

GAAAAGTAACAAATTATAGAATTTCCCAAAAGATGTTTTGATCCTACTAGTAGTATGCAG 
TGAAAATCTTTAGAACTAAATAATTTGGACAAGGCTTAATTTAGGCATTTCCCTCTTGAC 
CTCCTAATGGAGAGGGATTGAAAGGGGAAGAGCCCACCAAATGCTGAGCTCACTGAAATA 
TCTCTCCCTTATGGCAATCCTAGCAGTATTAAAGAAAAAAGGAAACTATTTATTCCAAAT 
GAGAGTATGATGGACAGATATTTTAGTATCTCAGTAATGTCCTAGTGTGGCGGTGGTTTT 



FIGURE 3FF 



